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2INC FINGER BINDING DOMAINS FOR GNN 

Technical Reld of the Invention 

The field of this invention is zinc finger protein binding to target nucleotides. More 
particularly, the present invention pertains to amino add residue sequences within the a- 
helical domain of zinc fingers that specificaily bind to target nucleotides of the formula 5'- 
(GNN)-3'. 

Background of the Invention 

The paradigm that the primary mechanism for governing the expression of genes 
involves protein switches that bind DNA in a sequence specific manner was established in 
1967 (Ptashne, fA. (1967) Nature (London) 214, 323-4). Diverse structural families of DNA 
binding proteins have been descrit^ed. Despite a weaHh of structural diversity, the Cy82-His2 
zinc finger motif constitutes the most frequently utilized nucleic acid binding motif in 
eul<aryotes. This observation is as tnje for yeast as It is for man. The Cysa-Hisj zinc finger 
motif, identified first in the DNA and RNA binding transcription factor TFIIIA (Miller, J., 
McLachlan, A. D. & Klug. A. (1985) Embo J 4, 1609-14), is perhaps the Ideal structural 
scaffold on which a sequence specific protein might be constructed. A single zinc finger 
domain consists of approximately 30 amino acids with a simple ppa fold stabilized by 
hydrophobic interactions and the chelation of a single zinc ion (Miller, J., McLachlan, A. D. & 
Klug, A. (1985) Embo J 4, 1609-14, Lee, M. S., GIppert, G. P., Soman. K. V., Case. D. A. & 
Wright. P. E. (1 989) Science 245, 635-7). Presentation of the a-helbc of this domain into the 
major groove of DNA allows for sequence specific base contacts. Each zinc finger domain 
typically recognizes three base pairs of DNA (Pavletich, N. P, & Pabo, C. O. (1991) Science 
(Washington, D. C, 1883-) 252, 809-17. Elrod-Erickson. M., Rould. M. A., Nekludova, L. & 
Pabo, C. O. (1996) Structure (London) 4. 1171-1180, Elrod-Erickson, M.. Benson, T. E. & 
Pabo. C. O. (1998) StnJCture (London) 6, 451-464, Kim. C. A. & Berg, J. M. (1996) Nature 
Structurai Biology 3, 940-945). though variation in helical presentation can allow lor 
recognitk>n of a more extended site (Pavletich. N. P. & Pabo, C. O. (1993) Science 
(Washington, O. C, 1883-) 2B^, 1701-7, Houbaviy, H. B.. Usheva, A.. Shenk. T. & Burley, 
S. K. (1996) Proc Nati Acad Sci U S A B3, 13577-82, Fairall, L. Schwabe. J. W. R., 
Chapnwn, L.. Finch, J. T. & Rhodes, D. (1993) Nature (London) 366, 483-7, Wuttke. D. S.. 
Foster, M. P.. Case, D, A., Gottesfeld, J. M. & Wright. P. E. (1997) J. Moi, Biol. 273, 183- 
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206). In contrast to most transcription factors that rely on dimerizatlon of protein domains 
for extending protein-DNA contacts to longer DNA sequences or addresses, simple 
covalent tandem repeats of the zinc finger domain allow for the recognition of longer 
asymmetric sequences of DNA by this motif. 

We have recently described polydactyl zinc finger proteins that contain 6 zinc finger 
domains and bind 18 base pairs of contiguous DNA sequence (Liu, Q., Segal, D. J., Ghiara, 
J. B. & Barbas ill, C. F. (1997) PNAS 94, 5525-5530). Recognition of 18 bps of DNA is 
sufficient to describe a unique DNA address within all known genomes, a requirement for 
using polydactyl proteins as highly specific gene switches. Indeed, control of both gene 
activation and repression has been shown using these polydactyl proteins in a model 
system (Liu, Q., Segal, D. J., Ghiara. J. B. & Barbas III, C. F. (1997) PAWS 94, 5525-5530). 

Since each zinc finger domain typically binds three base pairs of sequence, a complete 
recognition alphabet requires the characterization of 64 domains. Existing Information which 
could guide the construction of these domains has come from three types of studies: 
structure detenr^inatlon (Pavletich. N. P. & Pabo, C. O. (1991) Science (Washington, D. C, 
1883') 252, 809-17, EIrod-Erlckson, M., Rould, M. A.. Nekludova, L. & Pabo. C. O. (1996) 
Structure (London) A, 1171-1180. Elrod-Erickson, M., Benson, T. E. & Pabo. C. O. (1998) 
Stmcture (London)B, 451-464. Kim, C. A. & Berg. J. M. (1996) Nature Structural BioiogyZ, 
940-945, Pavletich, N. P. & Pabo, C. O. (1993) Science (Washington, D. C, f 853-; 261, 
1701-7, Houbavly. H. B., Usheva, A., Shenk, T. & Burley, S. K. (1996) Proc Nati Acad Sd U 
SA93, 13577-82. Fairall. L. Schwabe. J. W. R.. Chapman. L, Finch, J. T. & Rhodes. D. 
(1993) Nature (London) 366, 483.7.,1 1 , Wuttke. D. 8.. Foster, M. P.. Case. D. A., 
Gottesfeld. J. M. & Wright. P. E. (1997) J, Mot. BioL 273, 183-206.. Nolle. R. T., Conlln, R. 
M.. Harrison, S. C. & Brown. R. S. (1998) Proc Natl. Acad. Sci. U, S. A. 95. 2938-2943, 

Narayan, V. A.. Kriwacki, R. W. & Caradonna. J. P. (1997) J, BioL Chem. 272, 7801- 
7809.. site-directed mutagenesis (Isalan. M., Choo, Y. & Klug, A. (1997) Proc. Nati Acad. 
Sci. U. S. A, 94, 5617-5621 . Nardelli. J.. Gibson, T. J.. Vesque, C. & Chamay, P. (1991) 
Nature 349, 175-178, Nardelli. J.. Gibson. T. & Chamay, P. (1992) Nucleic Acids Res. 20, 
4137-44. Taylor, W. E., Suruki. H. K.. Lin. A. H. T.. Naraghi-Arani, P.. Igarashi, R. Y., 
Younessian. M.. Katkus. P. & Vo, N. V. (1995) e/oc/7ern/s//y 34. 3222-3230. Desjariais, J. R. 
& Berg. J. M. (1992) Proteins: StniU, Funct., Genet. 12. lOM.Desjartais. J. R. & Berg, J. 
M. (1992) Proc Natl Acad Sd U S A 89, 7345-9), and phagendlsplay selections (Choo. Y. & 
Kiug. A. (1 994) Proc Natl Acad Sc/ 1/ S >4 91 , 1 1 1 63-7. Grelsman. H. A. & Pabo. C. O. 
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(1997) Science (Washington, D. C.) 275, 657-661.23, Rebar, E. J. & Pabo. C. O. (1994) 
Science (Washington, D. C, 1883-) 263, 671-3. Jamieson, A, C, Kim. S.-H. & Wells. J. A. 
(1994) Biochemistry 5689-5695, Jamieson, A. C. Wang, H. & Kim, S.-H. (1996) PNAS 
93, 12834-12839, Isalan, M., Klug, A. & Choo. Y. (1998) Biochemistry 37, 12026-33, Wu, 
H., Yang, W.-P. & Barbas III, C. F. (1 995) PNAS 92, 344-348). All have contributed 
significantly to our understanding of zinc fInger/DNA recognition, but each has its 
flmitations. Structural studies have Identified a diverse spectrum of protein/DNA interactions 
but do not explain If altemaUve Interactions might be more optimal. Further, while 
interactions that allow for sequence specific recognition are observed, little information is 
provided on how alternate sequences are excluded frxjm binding. These questions have 
been partially addressed by mutagenesis of existing proteins, but the data is always limited 
by the number of mutants that can be characterized. Phage-display and selection of 
randomized libraries overcomes certain numerical limitations, but providing the appropriate 
selective pressure to ensure that both specificity and affinity drive the selection Is dlflicult. 
Experimental studies from several laboratories (Choo, Y. & Klug, A. (1994) Proc Natl Acad 
SdUSA9i,U1 63-7. Greisman. H. A. & Pabo. C. O. (1 997) Sdence (Washington, D, a) 
275, 657-661. Rebar. E. J. & Pabo. C. O. (1994) Science (Washington, D. C, 7553-; 263. 
671-3. Jamieson. A. C. Kim. S.-H. & Wells. J. A. (1994) Biochemistry 33, 5689-5695.25. 
Jamieson. A. C. Wang. H. & Kim. S.-H. (1996) PAWS 93. 12834-12839. Isalan, M.. Klug, A. 
& Choo. Y. (1998) Biochemistry 37, 12026-33). including our own (Wu. H., Yang, W.-P, & 
Barbas III. C. F. (1995) PNAS 92. 344-348). have demonstrated that it is possible to design 
or select a few members of this recognition alphabet. However, the specificity and affinity of 
these domains for their target DNA was rarely investigated in a rigorous and systematic 
fashion in these early studies. 

Since Jacob and Monod questioned the chemical nature of the repressor and 
proposed a scheme by which the synthesis of Individual proteins within a cell might be 
provoked or repressed, specific experimental control of gene expression has been a 
tantalizing prospect (Jacob. F. & Monod. J. (1961) J. MoL Biol. 3, 318-356). It is now well 
established that genomes are regulated at the level of transcription primarily through the 
action of proteins known as transcription factors that bind DNA In a sequence specific 
fashion. Often these protein factors act in a complex combinatorial manner allowing 
temporal, spatial, and environmentally-responsive control of gene expression (Ptashne M 
(1997) Nature Medicine 3. 1069-1072). Transcription factors frequently act both through a 
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DNA-bindIng domain which localizes the protein to a specific site within the genome, and 
through accessory effector domains which act to provoke (activate) or repress transcription 
at or near that site (Cowell, I. Q. (1994) Trends Biochem, Set. 19, 38-42). Effector domains, 
such as the activation domain VP16 (Sadowski, I., Ma, J.. Triezenberg, S. & Ptashne, M. 
(1988) Nature 563-564) and the repression domain KRAB (Margolin, J. F.. Friedman. 
J. R., Meyer, W„ K.-H., Vissing, H.. Thiesen, H.-J. & Rauscher III, F. J. (1994) Proc. Natl. 
Acad. ScL USA 91, 4509-4513), are typically modular and retain their activity when they are 
fused to other DMA-binding proteins. Whereas genes might be readily controlled by 
directing transcription factors to particular sites within a genome, the design of DNA binding 
proteins that might be fashioned to bind any given sequence has been a daunting 
challenge. 

The present disclosure is based on the recognition of the stmclural features unique to 
the Cys2-His2 class of nucleic acid-blnding, zinc finger, proteins. The Cysj-Hlsg zinc finger 
domain consists of a simple ppa fold of approximately 30 amino adds in length. Structural 
stability of this fold is achieved by hydrophobic interactions and by chelation of a single zinc 
ion by the conserved Cysg-Hlsa residues (Lee. M. S.. Gippert, G. P.. Soman, K. V.. Case, D. 
A. & Wright, P. E. (1989) Science 245, 635-637). Nucletoacid recognition is achieved 
through specific amino acid side chain contacts originating from the a-helbc of the domain, 
which typically binds three base pairs of DNA sequence (Pavletlch, N. P. & Pabo. C. O. 
(1991) Science 252. 809-17. Elrod-Erickson. M.. Rould. M. A., Nekludova. L & Pabo. C. O. 
(1996) Stnjcture 4, 1 17M 180). Unlike other nucleic add recognition motifs, simple covalent 
linkage of multiple zinc finger domains allows the recognition of extended asymmetric 
sequences of DNA. Studies of natural zinc finger proteins have shown that three zinc finger 
domains can bind 9 bp of contiguous DNA sequence (Pavletich, N. P. & Pabo, C. O. (1991) 
Science 252, 809-17.. Swirnoff, A. H. & Milbrandt, J. (1995) MoL Cell, Blot, 15, 2275-87). 
Whereas recognition of 9 bp of sequence is Insuff ident to spedfy a unique site within even 
the small genome of E. coli, polydactyl proteins containing six zinc finger domains can 
specify 18-bp recognition (Liu. Q.. Segal, D. J.. Ghlara. J. B. & Barbas III, C. F. (1997) Proc. 
Natl. Acad, Sci. USA 94, 5525-5530). With respect to the development of a universal 
system for gene control, an 1 8.bp address can be sufficient to spedfy a single sHe within all 
known genomes. While polydactyl proteins of this type are unknown in nature, however, 
their efficacy In gene activation and repressfon within living human cells has recently been 
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Shown (Uu. Q.. Segal. D. J., Ghiara, J. B. & Baibas III, C. F. (1997) Proc. Natl. Acad Sd. 
USA 94, 5525-5530). 

Br<9> Summaiv of the Inventinn 

In one aspect, the present invention provides an isolated and purified zinc finger- 
nucleotide binding polypeptide that contains the amino acid residue sequence of any of 
SEQ ID N0:1-16. In a related aspect, this invention further provides compositions 
comprising from two to about 12 such zinc flnger-nudeolide binding polypeptides. The 
composition preferably contains from 2 to about 6 polypeptides, in a preferred 
embodiment, the zinc finger-nucleotide binding polypeptide are operatively linked, 
preferably by an amino acid residue linlter having the sequence of SEQ ID N0 1 11 A 
composition of this invention specifically binds a nucleotide target that contains the 
sequence 5'-(GNN)„-3-, wherein each Nis A. C, Q, pfTwith the proviso that all N'scannot 
be C and where n is preferably 2 to 6. A polypeptide or composition can be further 
operatively linked to one or more transcnption modulating factors such as a transcription 
activators or transcipHon suppressors or repressor. The present invention also provides 
an isolated and purified polynucleotide that encodes a polypeptide or composition of this 
invention and an expression vector containing such a polynucleotide. 

In a still further aspect, the present invention provides a process of regulating the 
function of a nucleotide sequence that contains the sequence 5MGNN)n-3-. where n is an 
Integer from 1 to 6, the process comprising exposing the nucleotide sequence to an 
effective amount of a composition of this invention operatively linked to one or more 
transcripUon modulating factors. The 5'-(GNN)n^- sequence can be found in the 
transcribed region or promotor region of the nucleotide or wlthh an expressed sequence 
tag. 

The present disclosure demonstrates the simplicity and efficacy of a general 
strategy for the rapid production of gene switches. With a family of defined zinc finger 
domains recognizing sequences of the S'-GNN^ subset of a 64 member zinc finger 
alphabet, polydactyl proteins specifically recognizing novel 9- or. for the first time. 18*p 
sequences were constmcted and characterized. Potent transcription factors were generated 
and shown to control both gene activation and repression. Gene acthratton was achieved 
using the herpes simplex vinis VPl 6 activation domain (Sadowskl. I.. Ma. J.. Triezenberg 
S. & Ptashne. M. (1 988) Nature 335. 563-564) and a recombinant tetmmerte repeat of its' 
minimal activation domain. Gene repression or silencing was achieved using three effector 
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domains of human origin, the irruppe/ associated box (KRAB) (Margolin, J. F., Friedman, J. 
R., Meyer, W., K.-H., Vissing, H., Thiesen. H.-J. & Rauscher III, F. J. (1994) Proc. Natl. 
Acad. Sci. USA 91, 4509-4513). Ihe ERF repressor domain (ERD) (Sgouras, D. N., 
Athanasiou, M. A., Beat, G. J.. Jr.. Rsher. R. J.. Blair. D. G. & Mavrothalassltis, G. J. (1995) 
EMBO J. 14, 4781-4793). and the mSIN3 .Interaction domain (SID) (Ayer, 0. E.. Laherty, C. 
D., Lawrence, Q. A., Armstrong, A. P. & Elsenman, R. N. (1996) MoL Cell. Biol. 16, 5772- 
5781). Using lucif erase reporter gene assays in human epithelial cells, the data show that 
artificiai transcriptional regulators, designed to target the promoter of the proto-oncogene 
erbB'2/HER-2t can ablate or activate gene expression in a specific manner. For the first 
time, gene activation or repression was achieved by targeting within the gene transcript, 
suggesting that infonnation obtained from expressed sequence tags (ESTs) may be 
sufficient for the constmction of gene switches. The novel methodology and materials 
described herein promise diverse applications in gene therapy, transgenic organisms, 
functional genomics, and other ar^as of cell and molecular biology. 

Brief Description of the Drawing 

In the drawing, which forms a portion of the specification 

FIG. 1 (shown in six panels) shows the binding specificity of regions of zinc finger- 
nucleotide binding polypeptides of the invention. 

D^tail^C) P^spription Qf th? InventlQp 
1. The Invention 

The present invention provides zinc finger-nucleotide binding polypeptides, 
compositions containing one or more such polypeptides and the use of the polypeptides 
and compositions for modulating gene expression. 

II Compounds 

A compound of this invention is an isolated zinc finger-nucleotide binding 
polypeptide that binds to a GNN nucleotide sequence and modulates the function of that 
nucleotide sequence. The polypeptide can enhance or suppress transcription of a gene, 
and can bind to DNA or RNA. A zinc finger-nucleotide binding polypeptide refers to a 
polypeptide which is a derivatized form of a wild-type zinc finger protein or one produced 
through recombination. A polypeptide may be a hybrid which contains zinc finger domajn(s) 
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from one protein linked to zinc finger domain(s) of a second protein, for example. The 
domains may be wild type or mutagenized. A polypeptide Includes a truncated form of a 
wild type zinc finger protein. Examples of zinc finger proteins from which a polypeptide can 
be produced include TFIIIA and 2jf268. 

A zinc finger-nucleotide binding polypeptide of this invention comprises a unique 
heptamer (contiguous sequence of 7 amino add residues) within the a-helical domain of the 
polypeptide, which heptameric sequence determines binding specificity to a target 
necleotide. That heptameric sequence can be located anywhere within the ohhellcal 
domain but it is preferred that the heptamer extend from position -1 to position 6 as the 
residues are conventionally numbered in the art. A polypeptide of this invention can include 
any p-sheet and framework sequences known in the art to function as part of a zinc finger 
protein. A large number of zinc finger-nucleotide binding polypeptides were made and 
tested for binding specificity against target nucleotides containing a GNN triplet. The results 
of those studies are summarized in FIG. 1 . In FIG. 1 , the GNN triplet binding specificity for 
each peptide is shown in the right-hand column, wrth the highest specificity shown first and 
in boldface. In FIG. 1. SEQ ID Nos: are shown In parentheses. For each particular GNN 
(e.g., GAA, shown in the right-hand column of FIG. 1) target, the sequences are listed in 
order of decreasing specif icity for that Triplet. 

As shown In FIG. 1 , the data show a striking consen^ation of all three of the primary 
DNA contact positions (-1 , 3. and 6) was obsen/ed for virtuaUy all the clones of a given 
target. Although many of these residues were observed previously at these positions 
following selections with much less complete libraries, the extent of conservation obsen/ed 
here represents a dramatic improvement over eariier studies (Choo, Y. & Klug, A. (1 994) 
Pmc Natl Acad SciUSA9^,^^^ 63-7, Greisman, H. A. & Pabo, C. O. (1 997) Science 
(Washington, D. C.) 275, 657-661 , Rebar, E. J. & Pabo, C. O. (1994) Science 
(Washington, D. C, 1833-) 263, 671-3. Jamleson, A. C„ Kim. S.-H. & Wells. J. A. (1994) 
Biochemistry ZZ, 5689-5695. Jamieson, A. C. Wang, H. & Kim, S.-H. (1996) PNAS9Z, 
12834.12839..WU. H., Yang, W.-P. & Barbas III. C. F. (1995) PNASB2, 344-348). The 
present invention discloses that the teachings of the prior art that the three helical positions 
•1 , 3. and 6 of a zinc finger domain are sufficient to allow for the detailed description of the 
DNA binding specificity of the domain are incorrect. 

Typically, phage selections have shown a consensus selection In only one or two of 
tiiese positions. The greatest sequence variatton occurred at the residues in positions 1 
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and 5, which do not mal<e bases contacts in the 2if268/DNA structure and were expected 
not to contribute significantly to recognition (Pavletich, N. P. & Pabo. C. O. {1991} Sdence 
(Washington, D. C, 1883-)252, 809-17. Elrod-Ericl^son, M., Rould. M. A., Nekludova» L. & 
Pabo, C. O. (1996) Structure (London) 4, 1 171-1 180). Variation In positions 1 and 5 also 
implied that the conservation in the other positions was due to their interaction with the DMA 
and not simply the fortuitous amplification of a single clone due to other reasons. 
Conservation of residue identity at position 2 was also observed. The consen^ation of 
position -2 is somewhat artifactual; the NNK library had this residue fixed as serine. This 
residue makes contacts with the DNA backbone in the 2if268 structure. Both libraries 
contained an invariant leucine at position 4, a critical reskiue in the hydrophobic core that 
stabilizes folding of this domain. 

Impressive amino ackJ conservation was obsen^ed for recognition of the same 
nucleotide in different targets. For example, Asn in position 3 (Asn3) was virtually always 
selected to recognize adenine in the middle position, whether In the context of GAQ, QAA. 
GAT, or GAC. Gln-1 and Arg-1 were always selected to recognize adenine or guanine. 
respecUvely. in the 3' position regardless of context. Amide side chain based recognition of 
adenine by Gin or Asn is well documented in structural studies as is the Arg guanidinium 
side chain to guanine contact with a 3' or 5' guanine (EIrod-Erickson, M., Benson. T. E. & 
Pabo. G. O. (1998) Structure (London)B, 451-464. Kim. C. A. & Berg. J. M. (1996) Nature 
Structural Bhiogy 3, 940-945.. Fairall. L. Schwabe. J. W. R.. Chapman. L.. Rnch. J. T. & 
Rhodes. D. (1993) Nature (London) ZBB, 483-7). More often, however, two or three amino 
acids were selected for nucleotide recognition. His3 or Lys3 (and to a lesser extent. Gly3) 
were selected for the recognition of a middle guanine. Ser3 and Ala3 were selected to 
recognize a middle thymine. Thr3, Asp3. and Glu3 were selected to recognize a middle 
cytoslne. Asp and Glu were also selected in position -1 to recognize a 3' cytoslne. while 
Thr-1 and Ser-1 were selected to recognize a 3* thymine. 

Selected Zif268 variants were subcloned into a bacterial expression vector, and the 
proteins overexpressed (finger-2 proteins, hereafter referred to by the subsite for which they 
were panned). It is important to study soluble proteins rather than phage-fuslons since it is 
known that the two may differ significantly in their binding characteristics (Crameri. A.. 
Cwiria. S. & Stemmer. W. P. (1996) Nat. Med. 2, 100-102). The proteins were tested for 
their ability to recognize each of the 1 6 S'-GNN-S' finger -2 subsites using a multi-target 
ELISA assay. This assay provided an extremely rigorous test for specificfty since there 
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were always six "non-specific" sites which differed from the "specific" site by only a single 
nucleotide out of a nine-nucleotlde target. Many of the phage -selected finger-2 proteins 
showed exquisite specificity, while others dennonstrated varying degrees of crossreactivity. 
Some polypeptides actually bound better to subsites other than those for which they were 
selected. 

Attempts were made to improve binding specificity by modifying the recognition helix 
using site-directed mutagenesis. Data from our selections and structural information guided 
mutant design. As the most exhaustive study performed to date, over 100 mutant proteins 
were characterized in an effort to expand our understanding of the rules of recognition. 
Although helix positions 1 and 5 are not expected to play a direct role in DNA recognition, 
the best improvements in specificity always involved modifications in these positions. 
These residues have been observed to make phosphate backbone contacts, which 
contribute to affinity in a non-sequence specific manr>er. Renroval of non-spedfk: contacts 
increases the importance of the specific contacts to Ihe overall stability of the complex, 
thereby enhancing specificity. For example, the specificity of polypeptides for target triplets 
GAC, GAA, and GAG were improved simply by replacing atypical, charged reskJues In 
positions 1 and 5 with smaller, uncharged residues. 

Another class of modifications involved changes to both binding and non-binding 
residues. The crossreactivity of polypeptides for GGG and the finger-2 subsite GAG was 
abolished by the modifications His3Lys and ThrSVaL It is interesting to note that His3 was 
unanimously selected during panning to recognize the middle guanine, although Lys3 
provided better discrimination of A and G. This suggests that panning conditions for this 
protein may have favored selectton by a parameter such as affinity over that of specificity. 
In the 2if268 stnjcture, His3 donates a hydrogen bond to the N7 of the middle guanine 
(Pavletich, N. P. & Pabo, C. O. (1991) Science (Washington, D. a. t5S3-; 252. 809-17. 
Elrod-Erickson. M., Rould. M. A.. Nekludova. L. & Pabo. C. O. (1996) Structure (London) A, 
1 171-1 180). This bond couW also be made with N7 of adenine, and In fact af268 does not 
discriminate between G and A in this position (Swimoff, A. K & Milbrandt. J. (1995) Moi. 
Cefi BioL 15, 2275-87). His3 was found to specify only a middle guanine in polypeptides 
targeted to GGA. GGC, and GGT, even though Lys3 was selected during panning for GGG 
and GGT. Similariy, the multiple crossreactivities of polypeptides targeted to GTG were 
attenuated by modifications LyslSer and Ser3Glu, resulting In a 5-fold loss In affinity. Glu3 
has been shown to be very specific for cytosine in binding site selection studies of Zif268 
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(Swimoff, A. H. & Milbrandt, J, (1995) Moi Ceil, BioL 15, 2275-87). No structural studies 
show an interaction of Glu3 with the middle thymine, and Glu3 was never selected to 
recognize a middle thymine in our study or any others (Choo, Y. & Klug, A. (1 994) Proc Natl 
Acad SdUSA9'\,^^^ 63-7, Grelsman, H. A. & Pabo, C. O. (1 997) Science (Washington, 
a a) 275, 657-661 , Rebar, E. J. & Pabo, C. O. (1994) Sdence (Washington, D, C. 1883-} 
263, 671-3, Jamieson, A. C, Kim. S.-H. & Wells, J. A. (1994) Biochemistry 32, 5689-5695, 
Jamieson, A. C. Wang, H. & Kim, S.-H. (1996) PNAS92, 12834-12839. Isalan. M.. Klug, A. 
& Choo, Y. (1998) Biochemistry Z7, 12026-33, Wu, H., Yang. W.-P. & Barbas lit, C. F. 
(1995) PAMS92, 344-348). Despite this, the Ser3Glu modification favored the recognition 
of a middle thymine over cytosine. These examples illustrate the limitations of relying on 
previous structures and selection data to understand the structural elements underlying 
specificily. It should also be emphasized that improvements by modifications involving 
positions 1 and 5 could not have been predicted by existing "recognition codes' (Desjartals. 
J. R. & Berg, J. M. (1992) Proc N^ttAcadSdUSABQ, 7345-9.Suzuki, M.. Gerstein. M. & 
Yagi, N. (1994) Nucleic Adds Res, 22, 3397-405. Choo, Y. & Klug, A. (1994) Proc, Natl. 
Acad. Scl. U S. A 91, 1 1 168-72, Choo, Y. & Klug. A. (1997) Cum Opin, Struct, Bioi 7. 
1 17-125). which typically only consider positions -1 , 2, 3, and 6. Only by the combination of 
selection and site-directed mutagenesis can we begin to fully understand the intricacies of 
zinc finger/DNA recognition. 

From the combined selection and mutagenesis data it emerged that specific 
recognition of many nucleotides could be best accomplished using motifs, rather than a 
single amino acid. For example, the best specification of a 3' guanine was achieved using 
the combination of Arg-1 , Serl . and Asp2 (the RSD motif). By using ValS and Arge to 
specify a 5' guanine, recognition of subsltes GGG, GAG, GTG, and GCG could be 
accomplished using a common helix stmcture (SRSD-X-LVR) differing only in the position 3 
residue {Lys3 for GGG, Asn3 for GAG. Glu3 for GTG. and Asp3 for GCG). Similarly, 3' 
thymine was specified using Thr-I , Ser1 , and Gly2 in the final clones(the TSG motif). 
Further, a 3' cytosine could be specified using Asp-1, Prol, and Gly2 (the DPG motif) 
except when the subsite was GCC; Prol was not tolerated by this subsite. Specification of a 
3' adenine was with Gln-1 . Serl , Ser2 in two clones (QSS motif). Residues of positions 1 
and 2 of the motifs were studied for each of the 3' bases and found to provide optimal 
specificity for a given 3' base as described here. 
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The multi-target ELISA assay assumed that all the proteins preferred guanine in the 
5* position since all proteins contained Arg6 and this residue is known from structural 
studies to contact guanine at this position (Pavletich, N. P. & Pabo. C. O. (1991) Science 
(Washington, O. C. 1883-) 252, 809-17, Elrod-Eiickson. M.. RoukJ, M. A.. Nekludova. L & 
Pabo, C. O. (1996) Stnjcture (London) 4, 1171-1 180. EIrod-Erickson, M., Benson. T. E. & 
Pabo. C. O. (1998) Stnjctvre (London) 6, 451-464, Kim, C. A. & Berg, J. M. (1996) Nature 
Stmctuml Biology 3, 940-945, Pavletich. N. P. & Pabo, C. O. (1993) Science (Washington, 
D. a, 7853-; 261, 1701-7, Houbaviy. H. B., Usheva, A.. Shenk. T. & Burley, S. K. (1996) 
Pmc Natl Acad SdUSA 93, 13577-82, Fairall, L, Schwabe. J. W. R.. Chapman. L, 
Rnch, J. T. & Rhodes, D. (1993) Nature (London) ZSS, 483-7. Wuttke. D. S., Foster, M. P.. 
Case, D. A.. Gottesfeid, J. M. & Wright, P. E. (1997) J, Mol. Biol 273, 183-206, Nolte. R. T., 
Conlin, R. M., Harrison, S. C. & Brown, R. S. (1998) Proc, Natl. Acad. Sci. U. S. A. 95, 
2938-2943). This interaction was demonstrated here using the 5* binding site signature 
assay ((Choo, Y. & Klug. A. (1994) Proc. Natl. Acad. Sci U. S. >4. 91, 1 11 68-72); Fig. 2, 
white bars). Each protein was applied to pools of 16 oligonucleotide targets in whrch the 5' 
nucleotide of the finger-2 subsile was fixed as G, A, T, or C and the middle and 3' 
nudeotWes were randomized. All proteins prefenBd the QNN pool with essentially no 
crossreactivity. 

The results of the multi-target ELISA assay were confirmed by affinity studies of 
purified proteins. In cases where crossreactivity was minimal In the ELISA assay, a single 
nudeotlde mismatch typteally resulted in a greater than 100-foki k)ss in affinity. This degree 
of spedficity had yet to be demonstrated with zinc finger proteins. In general, proteins 
selected or designed to bind subsites with G or A in the middle and 3' position had the 
highest affinity, followed by those which had only one G or A in the middle or 3' position, 
followed by those which contained only T or C. The former group typically bound their 
targets with a higher affinity than Zif26a (10 nM), the latter with somewhat lower affinity, and 
almost all the proteins had an affinity lower than that of the parental C7 protein. There was 
no correiationbetween binding affinity and binding specificity suggesting that spectfidty can 
result not only from specific protein-DMA contacts, but also from interactions which exclude 
alt but the correct nucleotide. 

Asp2 was always co-selected with Arg-1 in all proteins for which the target subsite 
was GNG. It is now understood that there are two reasons for this. From structural studies 
of Zif268 (Pavletich, N. P. & Pabo, C. O. (1991) Science (Washington, D. C. 1883')2B2, 
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809-17, EIrod-Erickson, M.. Rould, M. A.. Nekludova, L & Pabo, C. O. (1996) Structure 
(Londof)) 4, 11 71 -11 80), it is known that Asp2 of finger 2 makes a pair of buttressing 
hydrogen bonds with Arg-1 whk:h stabilize the Arg-1/3' guanine tnteractk>n. as well as some 
water-mediated contacts. However the carboxylate of Asp2 also accepts a hydrogen bond 
from the N4 of a cytoslne that is base-paired to a 5' guanine of the finger- 1- subsite. Adenine 
base paired to T in this position can make an analogous contact to that seen with cytosine. 
This interaction is particularly important because it extends the recognitk>n subsite of finger 
2 from three nucleotides (GNG) to four (GNG{G/T)) (Isalan, M., Choo, Y. & Klug. A. (1997) 
Proc. Natl. Acad, Sd. U. S. A. 94, 5617-5621., Jamieson. A. C, Wang, H. & Kim, S.-H. 
(1996) PAMS93, 12834-12839, Isalan, M., Klug, A. & Choo, Y. (1998) Biochemistry Z7, 
12026-33). This phenomenon is refened to as "target site overiap% and has three Important 
ramifications. First, Asp2 was favored for selection by our library when the finger-2 subsite 
was GNG because our flnger-1 subsite contained a ^' guanine. Second, it may limit the 
utility of the libraries used in this study to selection on GNN or TNN finger-2 subsites 
because finger 3 of these libraries contains an Asp2, which may help specify the 5' 
nucleotide of the finger-2 subsite to be G or T. In Zif268 and C7, whk:h have Thr6 in finger 
2, Asp2 of finger 3 enforces G or T recognitton in the 5' position (T/G)GG. This interactton 
may also explain why previous phage display studies, which all used 2if268-based libraries, 
have found selection limited primarily to GNN recognition (Choo. Y. & Klug, A. (1994) Pix>c 
Nat! Acad Scf U S 4 91 , 11 1 63-7.. Rebar. E. J. & Pabo. C. O. (1 994) Scierice (Washington, 
D, C, 1883-)2B3, 671-3. Jamieson, A. C. Kim. S.-H. & Wells. J. A. (1994) Biochemistry ZZ, 
5689-5695. Jamieson, A. C. Wang, H. & Kim. S.-H. (1996) PNAS93, 12834-12839, 

Isalan, M.. Klug. A. & Choo, Y. (1998) Biochemistry 37, 12026-33, Wu, H„ Yang, 
W.-P. & Barbas III. C. F. (1995) PNA$92, 344-348). 

Finally, target site overlap potentially limits the use of these zinc fingers as modular 
buiiding blocks. From structural data it is known that there are some zinc fingers In which 
target site overlap is quite extensive, such as those in GLI and YY1 , and others which are 
similar to Zif268 and display only modest overlap. In our final set of proteins, Asp2 is found 
in polypeptides that bind GGQ, GAG, GTG, and GCG. The overlap potential of other 
residues found at position 2 is largely unknown, however structural studies reveal that many 
other residues found at this position may participate in such cross-subsite contacts. Fingers 
containing Asp2 may limit modularity, since they would require that each GNG subsite be 
followed by a T or Q. 
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Table 1 , below, summarized the sequences (SEQ ID N0s;1-16) showing the highest 
selectivity for the sbrteen embodiment of GNN target triplets. 



Table 1 



Target 


amino acids positions 


SEQ ID NO: 


Specificity 


-1 1 2 3 4 5 6 




GAA 


QSSNLVR 


1 


GAC 


DPGNLVR 


2 


GAG 


RSDNLVR 


3 


GAT 


TSGNLVR 


4 


GCA 


QSGDLRR 


5 


GCC 


DCRDLAR 


6 


GCG 


RSDDLVK 


7 


GOT 


TSGELVR 


8 


GGA 


QRAHLER 


9 


GGC 


DPGHLVR 


10 


GGG 


RSDKLVR 


11 


GGT 


TSGHLVR 


12 


GTA 


QSSSLVR 


13 


GTC 


DPG ALVR 


14 


GTG 


RSDELVR 


15 


GTT 


TSGSLVR 


16 



The data show that all possible GNN triplet sequences can be recognized 
with exquisite specificity by zinc finger domains. Optimized zinc finger domains can 
discriminate single base differences by greater than 100-fold loss in affinity. While nruiny of 
the amino acids found in the optimized proteins at the key contact positions -1 ,3, and 6 are 
those that are consistent with a simple code of recognition, Jt has been discovered that 
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Optimal specific recognition is sensitive to the context in which these residues are 
presented. Residues at positions 1 .2. and 5 have been found to be critical for specific 
recognition. Further the data demonstrates tor the first time that sequence motifs at 
positions -1,1, and 2 rather than the simple identity of the position 1 residue are required for 
highly specific recognition off the 3' base. These residues likely provide the proper 
stereochemical context for interactions of the helix both in terms of recognition of specific 
bases and in the exclusion of other bases, the net result being highly specific interactions. 
Broad utility of these domains would be realized if they were modular in both their 
interactions with DNA and other zinc finger domains. This could be achieved by worWng 
within the likely limitations imposed by target site overlap, namely that sequences of the 5'- 
(GNN)n-3' type should be targeted. Ready recombination of the disclosed domains then 
allows for the creation of polydactyl proteins of defined specificity precluding the need to 
develop phage display libraries in their generation. These polydactyl proteins have been 
used to activate and repress transcription driven by the human e/t?0-2 promoter In living 
cells. The family of zinc finger domains described herein is likely sufficient for the 
construction of 1 6* or 17 million novel proteins that bind the 5*-(QNN)8-3* family of DNA 
sequences. 

The zinc finger-nucleotide binding polypeptide derivative can be derived or produced 
from a wild type zinc finger protein by truncation or expansion, or as a variant of the wild 
type-derived polypeptide by a process of site directed mutagenesis, or by a combination of 
the procedures. The term "truncated* refers to a zinc finger-nucieotkJe binding polypeptide 
that contains less that the full number of zinc fingers found in the native zinc finger binding 
protein or that has been deleted of non-desired sequences. For example, truncation of the 
zinc finger-nucieofide binding protein TFIIIA, which naturally contains nine zinc fingers, 
might be a polypeptide with only zinc fingers one through three. Expansion refers to a zinc 
finger polypeptide to which additional zinc finger modules have been added. For example, 
TFIIIA may be extended to 12 fingers by adding 3 zinc finger domains. In addition, a 
truncated zinc finger-nucleotide binding polypeptide may include zinc finger modules from 
more than one wild type polypeptide, thus resulting in a "hybrid" zinc finger-nudeotWe 
binding polypeptide. 

The term "mutagenized" refers to a zinc finger derived- nucleotide binding 
polypeptide that has been obtained by performing any of the known methods for 
accomplishing random or site-directed mutagenesis of the DNA encoding the protein. For 
instance, in TFIIIA, mutagenesis can be performed to replace nonconserved residues in one 
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or more of the repeats of the consensus sequence. Truncated zinc finger-nucleotide 
binding proteins can also be mutagenized. 

Examples of known zinc finger-nucleotide binding polypeptides that can be 
tmncated, expanded, and/or mutagenized according to the present invention in order to 
inhibit the function of a nucleotide sequence containing a zinc finger-nucfeotlde binding 
motif includes TFIIIA and zif268. Other zinc finger-nucleotide binding proteins will be known 
to those of skill in the art. 

A polypeptide of this Invention can be made using a variety of standard technk|ues 
well known in the art (See. e.g., United States Patent Application No. 08/676,318 , filed 
1/18/1995, the entire disclosure of which is incorporated herein by reference). Phage 
display libraries of zinc finger proteins were created and selected under conditions that 
favored enrichment of sequence specific proteins. Zinc finger domains recognizing a 
number of sequences required refinement by site-dirQcted mutagenesis that was guided by 
both phage selection data and sdructural information. 

The murine Cys2-HiS2 zinc finger protein Zif266 is used for construction of phage 
display libraries (Wu. H.. Yang, W.-P. & Barbas III. C. F. (1995) PNAS 92, 344-348). Zif268 
is structurally the most well characterized of the zinc-finger proteins (Pavletich, N. P. & 
Pabo, C. O. (1991) Science (Washington, D. C, 18a3-)2S2, 809-17, EIrod-Erickson, M., 
Rould, M. A., Nekludova, L. & Pabo, C. O, (1996) Structure (London) 4, 1171-1180, 
Swirnoff, A. H. & Mitbrandt, J. (1995) MoL Cell, Bioi 15, 2275-87). DNA recognitk>n in each 
of the three zinc finger domains of this protein is mediated by residues in the N-terminus of 
the Q-helix contacting primarily three nucleotides on a single strand of the DNA. The 
operator binding site for this three finger protein is 5'-GCGIGGGCG-'3 (f lnger-2 subsite is 
underiined). Structural studies of Zif268 and other related zinc finger-DNA complexes 
(EIrod-Erickson. M.. Benson, T. E. & Pabo. C. O. (1998) Structure (London) 451-464, 
Kim, C. A. & Berg. J. M. (1996) Nature Structural Biology 3, 940-945. Pavletich, N. P. & 
Pabo, C. O, (1993) Science (Washington, D. C, 7853-; 261, 1701-7, Houbaviy. H. B., 
Usheva. A., Shenk, T. & Burley, S. K. (1996) Proc Natl Acad Sd US 13577-82. 
Fairall, L., Schwabe. J. W. R., Chapman, L.. Finch. J. T. & Rhodes, D. (1993) Nature 
(London) 366, 483-7, Wuttke. D. S., Foster, M. P., Case, D. A., Gottesfekl, J. M. & Wright. 
P. E. (1997) J. Mol. Biol. 273, 183-206., Nolte, R. T.. Conlln, R. M., Harrison, S. C. & Brown, 
R. S. (1998) Proc. Nati, Acad. Sci. U. S. A 95, 2938-2943, Narayan, V. A., Kriwacki, R. W. 
& Caradonna. J. P. (1997) J. BioL Chem. 272, 7801-7809) have shown that resWues from 
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primarily three positions on the a-helix. -1. 3, and 6, are involved in specific base contacts. 
Typically, the residue at position -1 of the a-helix contacts the 3' base of that finger's subsite 
^0 while positions 3 and 6 contact the. middle base and the 5' base, respectively. 

In order to select a family of zinc finger domains recognizing the 5 -GNN-3' subset of 
sequences, two highly diverse zinc finger libraries were constructed in the phage display 
vector pCombSH (Barbas III, C. F.. Kang, A. S., Lerner, R, A. & Benkovic, S. J. (1991) Proc. 
^5 NatL Acad. ScL USA 88, 7978-7982., Rader, C. & Barbas III, C. F. (1997) Cum Opin, 

Bhtechnol. 8, 503-508). Both libraries Involved randomization of residues within the ahelbc 
of finger 2 of C7. a variant of Zif268 (Wu. H.. Yang, W.-P. & Barbas III. C. F. (1995) PNAS 
92, 344-348). Library 1 was constnjcted by randomization of positions -1,1.2,3,5,6 using a 
NNK doping strategy while library 2 was constructed using a VNS doping strategy with 
randomization of positions -2,-1,1.2,3.5.6. Ihe NNK doping strategy allows for all amino 
add combinations within 32 codons while VNS precludes Tyr. Phe. Cys and all stop codons 
25 24 codon set. The libraries consisted of 4.4x1 0* and 3.5x1 members, respectively, 

each capable of recognizing sequences of the 5'-OCGNNNGCG-3':lype. The size of the 
NNK library ensured that it could be surveyed with 99% confidence while the VNS library 
was highly diverse but somewhat incomplete. These libraries are, however, significantly 
30 previously reported zinc finger libraries (Choo, Y. & Klug, A. (1 994) Proc Natl 

Acad Sc/ (; S 91 . 1 11 63-7. Greisman, H. A. & Pabo, C. O. (1 997) Science (Washington, 
D, a; 275, 657-661. Rebar. E. J. & Pabo, C. O. (1994) Science (Washington, D. C, 1863') 
263, 671-3. Jamieson. A. C, Kim. S.-H. & Wells, J. A. (1994) Biochemistry 5689-5695, 
^5 Jamieson, A. C. Wang. H. & Kim. S.-H, (1996) PNAS9Z, 12834-12839, Isalan, M.. Klug. 

. A. & Choo. Y. (1 998) Biochemistry 37, 1 2026-33). Seven rounds of selection were 
perfomied on the zinc finger displaylng-phage with each of the 16 5'-GCQQNNGCG-3* 
blotlnylated hairpin DNAs targets using a solution binding protocol. Stringency was 
maeased in each round by the addition of competitor DNA. Sheared herring spemi DNA 
was provided for selection against phage that bound non-specifically to DNA. Stringent 
selective pressure for sequence specificity was obtained by providing DNAs of the 5*- 
45 GCQNNNQCG-3' types as specific competitors. Excess DNA of the 5'-GCGGNNGCG-3' 

type was added to provide even more stringent selection against binding to DNAs with 
single or double base changes as compared to the biotinylated target. Phage binding to the 
single biotinylated DNA target sequence were recovered using streptavidin coated beads. 
50 In some cases the selection process was repeated. The present data show that these 
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domains are functionally modular and can be recombined with one another to create 
polydactyl proteins capable of binding 18-bp sequences with subnanomolar affinity. The 
fannily of zinc finger domains described herein is sufficient for the construction of 17 million 
novel proteins that bind the 5*-(GNN)e-3* family of DNA sequences. 

The invention includes a nucleotide sequence encoding a zinc finger-nudeotlde 
binding polypeptide. DNA sequences encoding the zinc finger-nucleotide binding 
polypeptides of the invention, including native, tmncated, and expanded polypeptides, can 
be obtained by several methods. For example, the DNA can be isolated using hybridization 
procedures which are well known In the art. These Include, but are not limited to: (1) 
hybridization of probes to genomic or cDNA libraries to detect shared nucleotide 
sequences; (2) antibody screening of expression libraries to detect shared structural 
features; and (3) synthesis by the polymerase chain reaction (PGR). RNA sequences of the 
invention can be obtained by methods known in the art (See for example. Current Protocols 
in Molecular Biology Ausubel, et aI.Eds.. 1989). 

The development of specific DNA sequences encoding zinc finger-nucleotide 
binding polypeptides of the invention can be obtained by: (1) isolation of a doubte-stranded 
DNA sequence from the genomic DNA; (2) chemical manufacture of a DNA sequence to 
provide the necessary codons for the polypeptide of interest; and (3) in vitrosynthesis of a 
double-stranded DNA sequence by reverse transcription of mRNA isolated from a 
eulcaryotic donor cell. In the latter case, a double-stranded DNA complement of mRNA Is 
eventually formed which is generally referred to as cDNA. Of these three methods for 
developing specific DNA sequences for use in recombinant procedures, the isolation of 
genomic DNA is the least common. This is especially tme when it is desirable to obtain the 
microbial expression of mammalian polypeptides due to the presence of introns. 

For obtaining zinc finger derived-DNA binding polypeptides, the synthesis of DNA 
sequences is frequently the method of choice when the entire sequence of amino acid 
residues of the desired polypeptide product is known. When the entire sequence of amino 
acid residues of the desired polypeptide is not known, the direct synthesis of DNA 
sequences is not possible and the method of choice is the formation of cDNA sequences. 
Among the standanJ procedures for Isolating cDNA sequences of Interest is the formation of 
plasmld-carrylng cDNA libraries which are derived from reverse transcription of mRNA 
which is abundant in donor cells that have a high level of genetic expression. When used in 
combination with polymerase chain reaction technology, even rare expression products can 
be clones. In those cases where significant portions of the amino acid sequence of the 
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polypeptide are known, the production of labeled single or double-stranded DNA or RNA 
probe sequences duplicating a sequence putalivety present in the target cDNA may be 
employed in DNA/DNA hybridization procedures which are canied out on cloned copies oi 
the cDNA which have been denatured Into a single-stranded form (Jay. et a!.. Nucleic Acid 
flesea/trf) 11:2325, 1983). 

In another aspect, the present invention provides a pharmaceutical composition 
comprising a therapeutically effective amount of a zinc finger-nucleotide binding polypeptide 
or a therapeutically effective amount of a nucleotide sequence that encodes a zinc finger- 
nucleotlde binding polypeptide in combination with a pharmaceutically acceptable canier. 

As used herein, the ternis "Ipharmaceutically acceptable", "physiologically tolerable* 
and grammatical variations thereof, as they refer to compositions, carriers, diluents and 
reagents, are used interchangeable and represent that the materials are capable of 
administration to or upon a human without the production of undesirable physiological 
effects such as nausea, dizziness, gastric upset and the like which would be to a degree 
that would prohibit administration of the composition. 

The preparation of a pharmacok}gicai composition that contains active Ingredients 
dissolved or dispersed therein is well understood in the art. Typteally such compositions are 
prepared as sterile Injectables either as liquid solutions or suspensions, aqueous or non- 
aqueous, however, solid forms suitable for solution, or suspensions. In liquid prior to use 
can also be prepared. The preparation can also be emulsified. 

The active ingredient can be mixed with exdpients which are pharmaceutically 
acceptable and compatible with the active ingredient and in amounts suitable for use in the 
therapeutic methods described herein. Suitable exdpients are, for example, water, saline, 
dextrose, glycerol, ethanol or the like and combinations thereof. In addition, if desired, the 
compositk>n can contain minor amounts of auxiliary substances such as wetting or 
emulsifying agents, as well as pH buffering agents and the like which enhance the 
effectiveness of the active ingredient. 

The therapeutic phamnaceutical composition of the present invention can include 
phannaceuticaily acceptable salts of the components therein. Pharmaceutically acceptable 
salts include the acid addition salts (formed with the free amino groups of the polypeptide) 
that are fonned with inorganic acids such as, for example, hydrochloric or phosphoric acids, 
or such organic adds as acetic, tartaric, mandellc and the like. Salts formed with the free 
carboxyl groups can also be derived from inorganic bases such as. for example, sodium. 
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potassium, ammonium, calcium or ferric hydroxides, and such organic bases as 
isopropylamine, trimethylamlne, 2-6thylamino ethanol, histidine, procaine and the like. 

Physiologically tolerable earners are well known In the art. Exemplary of liquid 
carriers are sterile aqueous solutions that contain no materials In addition to the active 
Ingredients and water, or contain a buffer such as sodium phosphate at physiological pH 
value, physiological saline or both, such as phosphate-buffered saline. Still further, 

15 aqueous carriers can contain more than one buffer salt, as well as salts such as sodium 

and potassium chlorides, dextrose, propylene glycol, polyethylene glycol and other solutes. 
Liquid compositions can also contain liquid phases in addition to and to the exclusion of 
water. Exemplary of such additional liquid phases are glycerin, vegetable oils such as 

2^ cottonseed oil, organic esters such as ethyl oleate, and water-oil emulsions. 

III. Compositions 

In another aspect, the present invention provides a plurality of zinc finger-nucleotide 
binding polypeptides operatlvely linked in such a manner to specifically bind a nucleotide 
target motif defined as 5'-(GNN)n-3'. where n Is an Integer greater than 1 . Preferably, n Is 
an integer from 2 to about 6. 

Means for linking zinc finger-nucleotide binding polypeplkie are described 
hereinafter in the Examples as well as In United States Patent Application No. 08/676,318, 
filed 1/1 a/1 995). The individual polypeptkJes are preferably linked with oligopeptide linkers. 
Such linkers preferably resemble the linker that are found In naturally occurring zinc finger 
35 proteins. A preferred linker for use in the present invention is the amino acid reskJue 

sequence TGEKP (SEQ ID NO: 1 1 1 ). 

To examine the efficacy of making such compositions and their use In gene control, 
the human e/6S-2 gene was chosen as a model. A polydaclyl protein speciftealty 
40 recognizing an 18bp sequence in the 5'-untranslated region of this gene was converted Into 

a transcriptional repressor by fusion with KRAB. ERD, or SID repressor domains. 
Transcriptional activators were generated by fusion with the herpes simplex VP1 6 activation 
domain or with a tetrameric repeat of VP16's minimal activatton domain, termed VP64. The 
data show for the first time that both gene represswn and activation can be achieved by 
targeting designed proteins to a single site within the transcribed region of a gene. 

The human erbB-2ger\e was chosen as a model target for the devetopment of zinc 
finger-based transcripttonal switches. Members of the ErtsB receptor family play Important 
roles in the development of human malignancies. In particular. erf>S-2 is overexpressed as 
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a result of gene amplification and/or transcriptional deregulation in a high percentage of 
human adenocarcinomas arising at numerous sites, including breast, ovary, lung, stomach, 
and salivary gland (Hynes, N. E. & Stem, D. F. (1994) Biochim. Biophys. Acta 1198, 165- 
184). Increased expression of Eii>B-2 leads to constitutive activation of its intrinsic tyrosine 
kinase, and has been shown to cause the transformation of cultured cells. Numerous 
clinical studies have shown that patients bearing tumors with elevated ErbB-2 expression 
levels have a poorer prognosis (Hynes, N. E. & Stern, D. F. (1994) Biochim. Biophys. Acta 
1198, 165-184). In addition to its Involvement in human cancer, erbB'2 plays important 
biological roles, both in the adult and during embryonal development of mammals (Hynes, 
N. E. & Stem, D. F. (1994) Biochim, Biophys. /4cfa 1198, 165-184, Altiok, N., Bessereau, 
J.-L. & Changeux, J.-P. (1995) EMBOJ. 14, 4258-4266, Lee, K.-F.. Simon, H., Chen, H.. 
Bates, B., Hung, M.-C. & Hauser, C. (1995) Nature 378, 394-398). 

The erf)0-2 promoter therefore represents aa interesting test case for the 
development of artificial transcriptional regulators. This promoter has been characterized in 
detail and has been shown to be relatively complex, containing both a TATA-dependent and 
a TATA-independent transcriptional initiation site (ishii, S., Imamolo, F„ Yamanashl. Y., 
Toyoshima, K. & Yamamoto, T. (1987) Proc. Natl. Acad. Sci. USA 84, 4374-4378). 
Whereas early studies showed that polydaclyl proteins could act as transcriptional 
regulators that specifically activate or repress transcription, these proteins bound upstream 
of an artificial promoter to six tandem repeats of the proteins binding site (Liu, Q., Segal, D. 
J., Ghiara. J. B. & Barbas III, C. F. (1997) Proc. Natt. Acad. Sci. USA 94, 5525-5530). 
Furthermore, this study utilized polydactyl proteins that were not modified in their binding 
specificity. Herein, we tested the efficacy of polydactyl proteins assembled from predefined 
building blocks to bind a single site in the native erbB-2 promoter. Described above is the 
generation and characterizatk>n of a family of zinc finger domains that bind each of the 16 
5-GNN-3' DMA triplets. One reason we focused on the production of this family of 
recognition domains is that promoter regions of most organisms are relatively GC rich in 
their base content. Thus, if proteins recognizing 5'-(GNN),-3' sites could be readily 
assembled from this set of defined zinc finger domains, many genes could be rapkily and 
specifically targeted for regulation. A protein containing six zinc finger domains and 
recognizing 18 bp of DNA should be sufficient to define a single address within all known 
genomes. Examination of the ert)6-2 promoter region revealed two 5'-(GNN)6-3'site8 and 
one 5'-(GNN)9-3' site. One of these sites, identified here as e2c, falls within the 5*- 
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untranslated region of the e/t)8-2gene and was chosen as the target site for the generation 
of a gene-specific transcriptional switch. A BLAST sequence similarity search of the 
GenBank data base confirmed that this sequence is unique to erbB'2. The position of the 
e2c target sequence, downstream and in the vicinity of the two major transcription Initiation 
sites» allowed for the examination of repression through inhibition of either transcription 
initiation or elongation. An interesting feature of the e2c target site is that it is found within a 
short stretch of sequence that is conserved between human, rat, and mouse e/t6-2 genes 
(White, M. R.-A. & Hung, M.-C. (1992) Oncogene 7, 677-683). Thus, targeting of this site 
would allow for the study of this strategy in animal models prior to Its application to human 
disease. 

For generating polydactyl proteins with desired DNA-binding specificity, the present 
studies have focused on the assembly of predefined zinc finger domains, which contrasts 
the sequential selection strategy proposed by Greisman and Pabo (Greisman, H. A. & 
Pabo, C. O, (1997) Science 275, 657-661). Such a strategy would require the sequential 
generation and selection of six zinc finger libraries for each required iprotein. making this 
experimental approach inaccessible to most laboratories and extremely time consuming to 
all. Further, since it is difficult to apply specific negative selection against binding altemative 
sequences in this strategy, proteins may result that are relatively unspecific as was recently 
reported (Kim, J.-S. & Pabo, C. O. (1997) J. Biol. Chem. 272, 29795-29800), 

The general utility of two different strategies for generating three-finger proteins 
recognizing 9 bp of DNA sequence was investigated. Each strategy was based on the 
modular nature of the zinc finger domain, and takes advantage of a family of zinc finger 
domains recognizing triplets of the 5'-GNN-3\ Two three-finger proteins recognizing 
half sites (HS) 1 and 2 of the 5'-(GNN)8-3' erf)B-2 target site e2c were generated in the first 
strategy by fusing the pre-defined finger 2 (F2) domain variants together using a PGR 
assembly strategy. To examine the generality of this approach, three additional three-finger 
proteins recognizing sequences of the 5'-(GNN)3-3' type, were prepared using the same 
approach. Purified zinc finger proteins were prepared as fusions with the maltose binding 
protein (MBP). ELISA analysis revealed that serially connected F2 proteins were able to act 
in concert to specifically recognize the desired 9-bp DNA target sequences. Each of the 5 
proteins shown was able to discriminate between target and non-target 5'-(GNN)3-3' 
sequence. 
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The affinity of each of the proteins for its target was determined by eleclrophorelic 
mobility-shift assays. These studies demonstrated that the zinc finger peptides have 
affinities comparable to Zif268 and other natural transcription factors with Kd values that 
ranged from 3 to 70 nM. Here the Kd of Zlf268 for its operator to be 10 nM. It must be 
noted that, for reasons that remain to be explained, one group has reported Kd values for 
the natural Zif268 protein that range from 6 nM to 10 pM, a 600-fold variation (Pavletich, N. 
P. & Pabo, C. O. (1991) Sdence 252, 809-17., Greisman, H. A. & Pabo, C. 0. (1997) 
Sdence 275, 657-661). Most studies have reported the Kd of the Zif268-DNA interaction to 
be from 3 to 10 nM, Choo, Y. & Klug, A. (1994) Proc. NatK Acad ScL USA 91, 11163- 
1 1167, Hamilton, T. B., Borel, F. & Romaniuk, P. J. (1998) Biochemistry 37, 2051-2058). 
Thus, in order to compare the results reported here with those reported elsewhere, the 
relative KdS should be compared, (Mutant Kd)/(Zif268 Kd), where both values are derived 
from the same report. The present data compare favorably to other studies of novel three- 
finger proteins prepared using phage display where affinities 10- to 200-fold weaker than 
Zif268 were reported (Greisman, H. A. & Pabo, C. O. (1997) Science 275, 657-661, Choo, 
Y.. Sanchez-Garcia, 1. & Klug, A. (1994) Nature 372, 642-5). 

As an altemative to the serial connection of F2 domain variants, in the second strategy, 
three-finger proteins specific for the two e2c 5'-(GNN)3-3* halfsites were produced by "helix 
grafting". The framework residues of the zinc finger domains, those residues that support 
the presentation of the recognition hetix, vary between proteins. We anticipated that the 
framewort< residues may play a role in affinity and specificity. For helix grafting, amino acki 
positions -2 to 6 of the DNA recognition helices were either grafted into a Zif268 (Pavletich, 
N. P. & Pabo, C. O. (1991) Science 252, 809-17) or an SpIC framework (Desjarlais, J. R. & 
Berg, J. M. (1993) Proc. Natl. Acad. Sci. USA 90, 2256-60). The Sp1C protein is a designed 
consensus protein shown to have enhanced stability towards chelating agents. The proteins 
were expressed from DNA templates prepared by a rapid PCR-based gene assembly 
strategy. In each case, ELISA analysis of MBP fusion proteins showed that the DfsIA binding 
specificities and affinities obsen/ed with the F2 framework constoicts were retained. 

As discussed above, the recognitton of 9 bp of Df^ sequence is not sufficient to specify 
a unique site within a complex genome. In contrast, a six-finger protein recognizing 18 bp of 
contiguous DNA sequence could define a single site in the human genome, thus fulfilling an 
important prerequisite for the generation of a gene-specific transcriptional switch. Six-finger 
proteins binding the 0ri)0-2 target sequence e2c were generated from three-finger 
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constojcls by simple restriction enzyme digestion and cloning with F2, 2if268, and SpIC 
framework template DNAs. ELiSA analysis of purified MBP fusion proteins showed that 
each of the six-finger proteins was able to recognize the specific target sequence, with little 
cross reactivity to non-target 5*-(GNN)6-3' sites or a tandem repeat of the Zlf268 target site. 

The affinity of each protein for the e2c DNA target site was determined by gal-shrf I 
analysis. A modest Kd value of 25 nM was observed with the E2C(F2) six-finger protein 
constructed from the F2 framework, a value that is only 2 to 3 times better than its 
constituent three-finger proteins. In our previous studies of six-finger proteins, we observed 
approximately 70-fold enhanced affinity of the six-finger proteins for their DNA ligand as 
compared to their three-finger constituents (Liu, Q.. Segal, D. J., Ghlara. J. B. & Barbas III, 
C. F. (1997) Proc. Natl. Acad, Set, USA 94, 5525-5530). The absence of a substantial 
increase In the affinity of the E2C(F2) peptide suggested that serial connection of F2 
domains is not optimal. It is possible that the periodicity of the F2 domains of the six-finger 
protein does not match that of the DNA over this extended sequence, and that a significant 
fraction of the binding energy of this protein is spent in unwinding DNA (Shi, Y. & Berg, J. 
M. (1996) Biochemistry 3S, 3845-8). In contrast to the F2 domain protein, the E2C(2if) and 
E2C(Sp1) six-finger proteins displayed 40- to 70-fold increased affinity as compared to their 
original three-finger protein constituents, with K<j values of 1.6nM and 0,5nM, respectively. 
Significantly, both three-finger components of these proteins were involved in binding, since 
mutation of either half -site led to a roughly lOO-foW decrease in affinity. The preponderance 
of known transcription factors bind their specific DNA ligands with nanomolar affinity, 
suggesting that the control of gene expression is governed by protein/DNA complexes of 
unexceptional I'lf e times. Thus, zinc finger proteins of increased affinity should not be 
required and could be disadvantageous, especially if binding to non-specifk: DNA is also 
increased. 

The zinc finger domain is generally considered to be modular in nature, with each finger 
recognizing a 3-bp subsite (Pavletich, N. P. & Pabo, C. O. (1991) Science 252, 809-17). 
This is supported by our ability to recombine zinc finger domains in any desired sequence, 
yielding polydactyl proteins recognizing extended sequences of the structure 5*-(GNN)h-3'. 
However, it should be noted that at least in some cases, zinc finger domains appear to 
specify overtapping 4 bp sites rather than individual 3 bp sites. In Zif268, residues In 
addition to those found at helix positions -1 , 3, and 6 are Involved in contacting DNA (Elrod- 
Erickson, M., Rould, M. A., Nekludova, L. & Pabo, C. O. (1996) Structure 4, 1 171-1180). 
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Specifically, an aspartate in helix position 2 of F2 plays several roles in recognition and 
makes a variety of contacts. The carboxylate of the aspartate side chain hydrogen bonds 
with arginine at position -1. stabilizing its Interaction with the 3'-guanine of its target site. 
This aspartate also participates in water-mediated contacts with the guanine's 
complementary cytosine. In addition, this carboxylate is observed to make a direct contact 
to the N4 of the cytosine base on the opposite strand of the S -guanine base of the finger 1 
binding site. It is this interaction which is the chemical basis for target site overlap. Indeed, 
when the 2if268 F2 libraries were selected against the four 5'-GCG GNQ GCG-3' 
sequences, both an arginine at position -1 and an aspartate at position 2 were obtained, 
analogous to the residues In native 2if268. Since the e2c target sequence (5'-QGG GCC 
GGA GCC GCA GTG-3') (SEQ ID NO: 1 12) is followed by an A rather than a G, a potential 
target site overlap problem was anticipated with finger 1 of an e2c-specific six-finger 
protein. However, in both the Zif- and SplG-framework six-finger proteins, the GTG-specific 
finger 1 containing an aspartate at position 2 appears to recognize the sequences 5'-GTGA- 
3' and 5'-GTGG-3' equally well, as indicated by their very similar affinities to target sites 
e2c-a and e2c-g. 

A polynucleotide or composition of this invention as set forth above, can be operatively 
linked to one or more transcription nrwdulating factors. Modulating factors such as 
transcription activators or transcription suppressors or repressors are well known In the art. 
Means for operatively linking polypeptides to such factors are also well known in the art 
Exemplary and preferred such factors and their use to modulate gene expression are 
discussed In detail hereinafter. 



II Uses 

In one embodiment, a method of the Invention Includes a process for modulating 
(inhibiting or suppressing) the function of a nucleotide sequence comprising a zinc flnger- 
nucleotkJe binding motif which comprises contacting the zinc finger-nudeotide binding motif 
with an effective amount of a zinc finger-nudeotide binding polypeptide that binds to the 
motif. In the case where the nudeotide sequence is a promoter, the method includes 
inhibiting the transcriptional transactivation of a promoter containing a zinc fInger-DNA 
binding motif. The term "inhibiting" refers to the suppression of the level of activation of 
transcription of a stojctural gene operably linked to a promoter, containing a zinc finger- 
nucleotide binding motif, for example. In addition, the zinc finger-nudeotide binding 
polypeptide derivative may bind a motif within a structural gene or within an RNA sequence. 
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The term "effective amount" includes that amount which results in the deactivation of a 
previously activated promoter or that amount which results in the inactivation of a promoter 
containing a zinc finger-nucleotide binding motif, or that amount which blocks transcription 
of a stnictural gene or translation of RNA. The amount of zinc finger derived-nucleolide 
binding polypeptide required is that amount necessary to either displace a native zinc 
finger-nucleotide binding protein in an existing protein/promoter complex, or that amount 
necessary to compete with the native zinc finger-nucleotide binding protein to form a 
complex with the promoter itself. Similarly, the amount required to block a structural gene 
or RNA is that amount which binds to and blocks RNA polymerase from reading through on 
the gene or that amount which inhibits translation, respectively. Preferably, the method is 
performed intracellularly. By functionally inactivating a promoter or structural gene, 
transcription or translation is suppressed. Delivery of an effective amount of the inhibitory 
protein for binding to or "contacting" the cellular nucleotide sequence containing the zinc 
finger-nucleotide binding protein motif, can be accomplished by one of the mechanisms 
described herein, such as by retroviral vectors or liposomes, or other methods well known in 
the art. 

The tenm ''modulating" refers to the suppression, enhancement or induction of a 
function. For example, the zinc finger-nucleotide binding polypeptide of the Invention may 
modulate a promoter sequence by binding to a motif within the promoter, thereby 
enchancing or suppressing transcription of a gene operatively linked to the promoter 
nucleotide sequence. Alternatively, modulation may include inhibition of transcription of a 
gene where the zinc finger-nucleotide binding polypeptide binds lo the structural gene and 
blocks DNA dependent RNA polymerase from reading through the gene, thus inhibiting 
transcriptk>n of the gene. The structural gena may be a normal cellular gene or an 
oncogene, for example. Altenatively, modulatbn may include Inhibition of translatkin of a 
transcript. 

The promoter region of a gene includes the regulatory elements that typically lie 5* to a 
staictural gene. If a gene is to be activated, proteins known as transcription factors attach 
to the promoter region of the gene. This assembly resembles an "on switch" by enabling an 
enzyme to transcribe a second genetic segment from DNA to RNA. In most cases the 
resulting RNA molecule serves as a template for synthesis of a specific protein; sometimes 
RNA itself Is the final product. 

The promoter region may be a normal cellular promoter or, for example, an onco- 
promoter. An onco-promoter is generally a virus-derived promoter. For example, the k)ng 
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terminal repeat (LTR) of retroviruses is a promoter region which may be a target for a zinc 
finger binding polypeptide variant of the invention. Promoters from members of the 
Lentivirusgroup, which include such pathogens as human T-cell lymphotrophic virus (KTLV) 
1 and 2, or human immunodeficiency virus (HIV) 1 or 2, are examples of viral promoter 
regions which rtiay be targeted for transcriptional modulation by a zinc finger binding 
polypeptide of the invention. 

In order to test the concept of using zinc finger proteins as gene-specific transcriptional 
regulators, the E2C(Sp1) six-finger protein was fused to a number of effector domains. 
Transcriptional repressors were generated by attaching either of three human-derived 
repressor domains to the zinc finger protein. The first repressor protein was prepared using 
the £RF repressor domain (ERD) (Sgouras. D. N., Athanasiou, M. A., Beal. G. J., Jr.. 
Rsher, R. J., Blair, D. G. & Mavrothalassitis, G, J. (1995) EMBO J. 14, 4781-4793). defined 
by amino acids 473 to 530 of the e(52 repressor factor (ERF). This domain mediates the 
antagonistic effect of ERF on th^ activity of transcription factors of the ets family. A 
synthetic repressor was constructed by fusion of this domain to the Q-terminus of the zinc 
finger protein. The second repressor protein was prepared using the KfOppel-associated 
feox (KRAB) domain (Margolin, J. F., Friedman, J. R.. Meyer, W.. K.-H., Vissing, H.. 
Thiesen, H.-J. & Rauscher III. F. J. (1994) Proc. Natl, Acad. ScL USA 91, 4509-4513). This 
repressor domain is commonly found at the N-terminus of zinc finger proteins and 
presumably exerts its repressive activity on TATA-dependent transcription in a distance- 
and orientation-independent manner (Pengue. G. & Lania, L (1996) Proc. Natl. Acad. Set. 
USA 93, 1015-1020), by interacting with the RING finger protein KAP-1 (Friedman, J. R., 
Fredericl<s, W. J., Jensen, D. E., Speicher, D. W., Huang, X.-P., Neilson, E. G. & Rauscher 
III, F. J. (1996) Genes & Dev. 10, 2067-2078). We utilized the KRAB domain found between 
amino acids 1 and 97 of the zinc finger protein K0X1 (Margolin, J. F., Friedman, J. R., 
Meyer. W., K.-H.. Vissing. H.. Thiesen, H.-J. & Rauscher III. F. J. (1994) Proc. Natl. Acad. 
ScK USA 91, 4509-4513). In this case an N-terminal fusion with the six-finger protein was 
constructed. Rnally, to explore the utility of histone deacetylation for repression, amino 
acids 1 to 36 of the Mad mSIN3 interaction ijomain (SID) were fused to the N-terminus of 
the zinc finger protein (Ayer, D. E., Laherty, C. D., Lawrence, Q. A., Armstrong, A. P. & 
Eisenman, R. N. (1 996) Moi Cell. Biol. 1 6. 5772-5781 ). This small domain is found at the N- 
tenrtinus of the transcription factor Mad and is responsible for mediating its transcriptional 
repression by interacting with mSIN3, which in tum interacts the co-repressor N-CoR and 
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with the histone deacetylase mRPDI (Heinzel, T., Uvinsky, R. M., Mullen, T.-M.. 
S§derstr§m. M., Laherty, C. D.. Torchia. J.. Yang. W.-M.. Brard, G., Ngo, S. D, & a!., e. 
(1997) Nature 387, 43-46). To examine gene-specific activation, transcriptional activators 
were generated by fusing the zinc finger protein to amino acids 413 to 489 of the herpes 
simplex virus VP16 protein (Sadowsici, L, Ma, J., Triezenberg, S. & Ptashne, M. (1988) 
Nature 335, 563-564), or to an artificial tetrameric repeat of VP16's minimal activation 
domain, DALDDFDLDML (SEQ ID NO:113) (Seipel. K„ Georgiev, O. & Schaffner, W. 
(1992) EMBO J. 11, 4961-4968), termed VP64. 

Reporter constructs containing fragments of the erbB'2 promoter coupled to a lucif erase 
reporter gene were generated to test the specific activities of our designed transcriptional 
regulators. The target reporter plasmid contained nucleotides -758 to -1 with respect to the 
ATG initiation codon. whereas the control reporter plasmid contained nucleotides -1571 to - 
24, thus lacking all but one nucleotide of the E2C binding site encompassed in positions -24 
to -7. Both promoter fragments displayed similar activities when transfected transiently Into 
HeLa cells, in agreement with previous obsen/ations (Hudson, L. Q., Ertl, A. P. & Gill, G. N. 
(1990) J. Biol. Chem. 265, 4389-4393). To test the effect of zinc finger-repressor domain 
fusion constructs on erbB-2 promoter activity, HeLa cells were transiently co-transfected 
with each of the zinc finger expression vectors and the lucif erase reporter constructs (Fig. 
5A). Significant repression was observed with each constmct. The ERD and SID fusion 
proteins produced approximately 50% and 80% repression, respectively. The most potent 
repressor was the KRAB fusion protein. This protein caused complete repression of erbB'2 
promoter activity. The observed residual activity was at the bacltground level of the 
promoter-less pGL3 reporter. In contrast, none of the proteins caused significant repression 
of the control erbB'2 reporter constmct lacl<ing the E2C target site, demonstrating that 
repression is indeed mediated by specific binding of the E2C(Sp1) protein to its target site. 
Expression of a zinc finger protein lacking any effector domain resulted in weak repression, 
approximately 30%, indicating that most of the repression obsen^ed with the SID and KRAB 
constructs is caused by their effector domains, rather than by DMA-binding alone. This 
obsen/atk>n strongly suggests that the mechanism of repression Is active inhibitbn of 
transcription initiation rather than of elongation. Once initiation of transcription by RNA 
polymerase II has occured. the zinc finger protein appears to be readily displaced from the 
DMA by the action of the polymerase. 
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The Utility of gene-specific polydactyl proteins to mediate activation of transcription was 
investigated using the same two reporter constaicts. The VP1 6 fusion protein was found to 
stimulate transcription approximately 5-fold, whereas the VP64 fusion protein produced a 
27-fold activation. This dramatic stimulation of promoter activity caused by a single VP16- 
based transcriptional activator is exceptional in view of the fact that the zinc finger protein 
binds in the transcribed region of the gene. This again demonstrates \ha\ mere binding of a 
zinc finger protein, even with one with subnanomolar affinity, in the path of RNA polymerase 
II need not neccessarily negatively affect gene expression. 

The data herein show that zinc finger proteins capable of binding novel 9- and 18-bp 
DNA target sites can be rapidly prepared using pre-defined domains recognizing 5'-GNN-3' 
sites. This information is sufficient for the preparation of 16^ or17 million novel six-finger 
proteins each capable of binding 18 bp of DNA sequence. This rapid methodology for the 
constmctlon of novel zinc finger proteins has advantages over the sequential generation 
and selection of zinc finger domains proposed by others (Greisman, H. A. & Pabo, C. O. 
(1997) Science 275, 657-661) and takes advantage of structural information that suggests 
that the potential for the target overlap problem as defined above might be avoided in 
proteins targeting 5'-GNN-3' sites. Using the complex and well studied erbB'2 promoter and 
live human cells, the data demonstrate that these proteins, when provided with the 
appropriate effector domain, can be used to provoke or activate expression and to produce 
graded levels of repression down to the level of the background in these experiments. 
These studies suggest that the KRAB domain is signifk:antly more potent as a 
transcriptional repressor than ERD or SID domains, and that it is able to inhibit both the 
TATA-dependent and the TATA-independent transcriptional initiation of this promoter. 
These repressor domains have not previously been directly compared. The present strategy 
of using predefined zinc finger domains to construct polydactyl proteins coupled to effector 
domains has significant advantages over strategies that attempt to only repress 
transcription by competing or interfering with proteins involved in the transcription complex 
(Kim. J.-S. & Pabo, C. O. (1997) J. Biol. Ci)em. 272, 29795-29800, Kim. J.-S., Kim, J., 
Cepek, K. L., Sharp. P. A. & Pabo, C. O. (1997) Proa. Natl. Acad. Set. USA 94, 3616-3620). 
Utilization of effector domains that have the potential to act over a distance shouW albw the 
application of these gene-switches to the regulation of uncharacterized genes and 
promoters. Since these transcripltonal regulators might be prepared using our PCR- 
assembly strategy in a high-throughput fashion, we believe It is appropriate to comment on 
their potential practical applications. Novel DNA binding proteins generated In this manner 
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should have potential utility In DNA-based diagnostic applications. For the study of gene 
function, we believe that the ability to both activate and repress the transcription of genes, 
at graded levels If necessary, may assist in assigning gene function. Since these proteins 
exert thefr control by acting in tmns, functional gene knockout or activation might be 
produced in heterozygous transgenic animals. This would drastically reduce the time 
required to produce a gene knockout in a whole animal and would extend the range of 
organisms to which knockout technology might be applied. These proteins might also be 
used in gene therapy applications to inhibit the production of viral gene products or to 
activate genes involved in fighting disease. Signifk;antly. the ease with which these proteins 
can be prepared will facilitate the testing of these ideas by the scientific community. 

The Examples that follow Illustrate preferred embodiments of the present invention 
and are not limiting of the specification or claims in any way. 

EXAMPLE 1: Selection by phage display. 

Construction of zinc-finger libraries by PGR overiap extensior* was essentially as 
previously described (Shi, Y. & Berg, J. M. (1996) Biochemistry Z5, 3845-8). Growth and 
precipitation of phage were as prevrously described (Pengue, G. & Lania, L. (1996) Pmc. 
Natl. Acad, Sd. USA 93, 1015-1020, Friedman, J. R.. Fredericks, W. J., Jensen. D. E., 
Spelcher, D. W.. Huang. X.-P., Neilson. E. G. & Rauscher III. F. J. (1996) Genes & Dev, 10, 
2067-2078), except that ER2537 cells (New England Biolabs) were used to propagate the 
phage and SOpM ZnCb was added to the growth media. Precipitated phage were 
resuspended in Zinc Buffer A (ZBA; 10 mM Tris, pH7.5/90 mf^ KCI, 1 mM f^gClz. 90 pM 
ZnCl2)/1% BSA /5 mM DTT. Binding reactions (500 fjl: ZBA/5 mM DTT/1% Blotto 
(BioRad)/competitor oiigonucleotldes/4pg sheared herring sperm DNA (Sigma)/100Ml 
filtered phage («10" colony forming units)) were incubated for 30 minutes at room 
temperature, prior to the addition of 72nM biotinylated hairpin target oligonucleotide. 
Incubation continued tor 3.5 hours with constant gentle mixing. Streptavldln-coated 
magnetic beads (50pl; Dynal) were washed twice with 500|j1 ZBA/1% BSA. then blocked 
with 500pl ZBA/6% Blotto/ antibody-displaying (irrelevant) phage (-lO" colony forming 
units) for «4 hours at room temperature. At the end of the binding period, the btocking 
solution was replaced by the binding reaction and incubated 1 hour at room temperature. 
The beads were washed 10 times over a 1 hour period with SOOpf ZBA/5 mM DTT/2% 
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Tween 20, then once without Tween 20. Bound phage were eluled 30 minutes with 10 
pg/pl trypsin. 

Hairpin target oligonucleotides had the sequence 5 -Biotin- 
GGACGCN*N'N*CGCGQGTTTTCCCGCGNNNGCGTCC-3* (SEQ ID NO:114), where NNN 
was the 3-nucleotide finger 2-target sequence and N'NTJ* its complement. A similar 
nonbiotinylated oligonucleotide, in which the target sequence was TGG (compTGG), was 
included at 7.2nM in every round of selection to select against contaiminating parental 
phage. Two pools of nonbiotinylated oligonucleotides were also used as competitors: one 
containing all 64 possible 3-nucleotlde targets sequences (compNNN). the other containing 
all the GNN target sequences except for the current selection target (compGNN). These 
pools were typically used as follows: round 1 , no compNNN or compGNN; round 2, 7.2nM 
compGNN; round 3, 10.8nM compGNN; round 4. 1.8pM compNNN, 25nM compGNN; round 
5, 2.7mM compNNN, 90nM compGNN; round 6, 27\M compNNN, 250nM compGNN; round 
7. 3.6pM compNNN. 250nM compGNN. 

EXAMPLE 2: Multi-target specificity assays. 

The fragment of pCombSH (Pengue. G. & Lania, L. (1996) Proc. Natl. Acad, Sd, USA 93, 
1015-1020, Heinzel, T., Lavinsky, R. M., Mullen. T.-M., SSderstrSm, M., Laherty. C. D.. 
Torchia. J.. Yang. W.-M.. Brard, G.. Ngo, S. D. & al.. e. (1997) Nature 387, 43-46) 
phagemid RF DNA containing the zinc-finger coding sequence was subcloned into a 
modified pMAL-c2 (New England Biotabs) bacterial expression vector and transformed into 
XLI-Blue (Stratagene). Freeze/thaw extracts containing the overexpressed maltose binding 
protein-zinc finger fusion proteins were prepared from IPTG-induced cultures using the 
Protein Fusion and Purification System (New England Biolabs). In 96-well ELtSA plates. 
0.2 pg of streptavidin (Pierce) was applied to each well for 1 hour at 37**C, then washed 
twice with water. Biotinylated target oligonucleotide (0.025 pg) was applied 
similariy.ZBA/3% BSA was applied for blocking, but the well were not washed after 
incubation. AH subsequent incubations were at room temperature. Eight 2-fold serial 
dilutions of the extracts were applied in 1x binding buffer (2BA/1% BSA/5 mM DTT/0.12 
jjg/pl sheared hening sperm DNA). The samples were incubated 1 hour, followed by 10 
washes with water. Mouse anti-maltose binding protein mAb (Sigma) in ZBA/1% BSA was 
applied to the wells for 30 minutes, followed by 10 washes with water. Goat anU -mouse IgG 
mAb conjugated to alkaline phosphatase (Sigma) was applied to the wells for 30 minutes, 
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followed by 10 washes with water. Alkaline phosphatase substrate (Sigma) was applied, 
and the OD40S was quantitated with SOFTmax 2.35 (Molecular Devices). 

EXAMPLE 3: Gel mobility shift assays. 

Fusion proteins were purified to >90% homogeneity using the Protein Fusion and 

Purification System (New England Biolabs), except that ZBA/5 mM DTT was used as the 
column buffer. Protein purity and concentration were detennined Irom Coomassie blue- 
stained 15% SDS-PAGE gels by comparison to BSA standards. Target oligonucleotides 
were labeled at their 5' or 3' ends with [^P] and gel purified. Eleven 3-fold serial dilutions of 
protein were incubated in 20 pi binding reactions (IxBinding Buffer/1 Oy© glycerol/»1 pM 
target oligonucleotide) for three hours at room temperature, then resolved on a 5% 
polyacrlyamide gel in O.SxTBE buffer. Quantitation of dried gels was performed using a 
Phosphorlmager and ImageQuant software (Molecular Dynamics), and the Kd was 
determined by scatchard analysis. 

EXAMPLE 4: Generation of polydactyl proteins with desired DNA binding specificity. 

The studies reported here use the finger 2 {F2) variants pmGAC. pmGAG, pGCA, 
pGCC, pmGGA, pmGGC, pmGGG, and pGTG defined in the accompanying manuscript 
(Hudson, L G., ErtI, A. P. & Gill, G. IM. (1990) J. Biol. Cham. 265, 4389-4393). To generate 
DMAs encoding three-finger proteins. F2 coding regions were PGR amplified from selected 
or designed F2 variants and assembled by PGR overlap extension. Alternatively. DMAs 
encoding three-finger proteins with a Zif26a or Sp1C framework were synthesized from 8 or 
6 overiapping oligonucleotides, respectively. Sp1C framework constnjcts, used for all 
reporter assays described in this report, were generated as follows. In the case of E2C- 
HSI(Spl). 0.4 pmole each of oligonucleotides SPE2-3 (5*-GCG AGO AAG QTC GCG GCA 
GTC ACT AAA AGA TTT GCC GCA CTC TGG GCA TTT ATA CGG TTT TTC ACC-a*) 
(SEQ ID N0:1 15) and SPE2-4 (5'-GTG ACT GCC GCG ACC TTG CTC GCC ATC AAC 
GCA CTC ATA CTG GCG AGA AGC CAT ACA AAT GTC CAG AAT GTG GC-S*) (SEQ ID 
1^:1 16) were mixed with 40 pmole each of oligonucleotides SPE2-2 (5*-GGT AAG TCC 
TTC TCT CAG AGC TCT CAC CTG GTG CGC CAC CAG CGT ACC CAC ACG GGT GAA 
AAA CCG TAT AAA TGC CCA GAG-3') (SEQ ID NO:1 1 7) and SPE2.5 (5'-ACG CAC CAG 
Cn GTC AGA GCG GCT GAA AGA CTT GCC ACA TTC TGG ACA TTT GTA TGG C-S*) 
(SEQ ID N0:1 18) in a standard PGR mixture and cycled 25 times (30 seconds at 94*C, 30 
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seconds at 60°C, 30 seconds at 72^*0. An aliquot of this pre-assembly reaction was then 
amplified with 40 pmole each of the primers SPE2-1 (5'-GAG GAG GAG GAG GTG GCC 
CAG QCG GCC CTC GAG CCC GGG GAG AAG CCC TAT GCT TGT COG GAA TGT GGT 
AAG TCC TTC TOT CAG AGC-3') (SEQ ID N0:1 19) and SPE2-6 (5'-6AQ GAG GAG GAG 
CTG GCC GGC CTG GCC ACT AGT TTT TTT ACC GGT GTG AGT ACG TTG GTG ACG 
CAC CAG CTT GTC AG A GCG-3') (SEQ ID NO:120) using the same cycling contltlons. The 
E2C'H52(Sp1) DNA was generated in the same way, using an analogous set of 
oligonucleotides differing only in the recognition helix coding regions. All assembled three- 
finger coding regions were digested with the restriction endonuclease Sfil and cloned into 
pMal-CSS, a derivative of the bacterial expression vector pMal-C2 (New England Biolabs). 
DMAs encoding six-finger proteins with each of the different frameworics were assembled in 
pMal-CSS using Xma 7 and BsrFI restriction sites included in the sequences flanking the 
three-finger coding regions. Each of the zinc finger proteins was expressed in the E, co/f 
strain XL1-blue and binding properties were investigated by ELISA and gel shift analysis as 
described in the accompanying manuscript (Hudson, L. G., Erti, A. P. & Gill, G. N. (1990) J. 
Biot. Chem. 265, 4389-4393). 

EXAMPLE 5: Construction of zinc flnger-effector domain fusion proteins. 

For the construction of zinc finger-effector domain fusion proteins, DMAs encoding 
amino acids 473 to 630 of the sts repressor factor (ERF) repressor domain (ERD) (Sgouras, 
D. N., Athanasiou, M. A., Beal. G. J., Jr., Rsher. R. J., Blair. D. G. & Mavrothalassitls. G. J. 
(1 995) EMBO J. 1 4, 4781 -4793), amino adds 1 to 97 of the KRAB domain of KOX1 
(Margolin. J. F.. Friedman, J, R., Meyer. W., K.-H., Vissing, H., Thiesen, H.-J. & Rauscher 
III, F. J. (1994) Proc, NatL Acad. Set. USA 91, 4509-4513), or amino acids 1 to 36 of the 
Mad mSIN3 interaction ijomain (SID) (Ayer, D, E., Laherty, C. D., Lawrence, Q. A., 
Armstrong. A. P. & Eisenman, R. N. (1996) Mol. Cell. BhL 16, 5772-5781 ) were assembled 
from overlapping oligonucleotides using Taq DNA polymerase. The coding region for amino 
acids 413 to 489 of the VP16 transcriptional activation domain (Sadowski, I., Ma, J., 
Triezenberg, S. & Ptashne. M. (1988) Nature 335, 563-564) was PGR amplified from 
PCDNA3/C7-C7-VP16 (10). The VP64 DNA, encoding a letrameric repeat of VP16's minimal 
activation domain, comprising amino acids 437 to 447 (Seipel, K,, Georgiev, O. & 
Schaffner. W. (1992) EMBO J. 11. 4961-4968), was generated from two pairs of 
complementary oligonucleotides. The resulting fragments were fused to zinc finger coding 
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regions by standard cloning procedures, such that each resulting constmct contained an 
Internal SV40 nuclear localization signal, as well as a C-terminal HA decapeptide tag. 
Fusion constructs were cloned in the eucaryolic expression vector pcDNA3 (Invitrogen). 

EXAMPLE 6: Construction of luclferase reporter plasmlds. 

An erbB-2 promoter fragment comprising nucleotides -758 to -1 , relative to the ATG 
Initiation codon, was PGR amplified from human bone marrow genomic DNA with the 
TaqExpand DNA polymerase mix (Boehringer Mannheim) and cloned Into pGL3basic 
(Promega), upstream of the firefly luclferase gene. A human orf)S-2 promoter fragment 
encompassing nucleotides -1571 to -24. was excised from pSVOALA57erbB-2(N-N) 
(Hudson. L. G.. ErtI, A. P. & Gill, G. N. (1990) J. Bioi Chem. 265, 4389-4393) by Hind3 
digestion and subcloned into pGL3basic, upstream of the firefly luciferase gene. 

EXAMPLE?: Luclferase assays. 

For all transfections, HeLa cells were used at a confluency of 40-i60%. Typically, cells 
were transfected with 400 ng reporter plasmid (pGL3-promoter constaicts or, as negative 
control. pGL3basic). 50 ng effector plasmid (zinc finger constructs in pcDNAS or, as 
negative control, empty pcDNAS), and 200 ng internal standard plasmid (phrAct-pOal) in a 
well of a 6 well dish using the lipofectamlne reagent (Gibco BRL). Cell extracts were 
prepared approximately 48 hours after transfection. Luciferase activity was measured with 
luciferase assay reagent (Promega), pGal activity with Galacto-Light (Tropix). In a 
MicroLumat LB96P luminometer (EG&G Berthold). Luciferase activity was normalized on 
pGal activity. 

EXAMPLE 8: Regulation of the erbB-2 gene in Hela cells. 

The erbB*2 gene was targeted for imposed regulation. The erbB-2 gene is frequently 
overexpressed in human cancers, particularly breast and ovarian, and elevated ErbB-2 
levels correlate with a poor prognosis (N. E. Hynes and D. F. Stem, Biochim. Biophys. Acta 
1198, 165 (1994)). To regulate the native erbB-2 gene, a synthetic repressor protein, 
designated E2C-KRAB, and a transactivator protein, designated E2C-VP64, were utilized 
(R. R. Beerii. D. J. Segal, B. Dreier, C. F. Barbas, til. Proc. Natf. Acad. Sa\ USA 95, 14628 
(1998)). Both proteins contain the same designed zinc finger protein E2C that recognizes 
the 18-bp DNA sequence 5*-GGG GCC GGA GGC GCA GTG-3' (SEQ ID NO:121) In the 5'- 
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untranslated region of the proto-oncogene erbB-2. This DNA-blnding protein was 
constructed from 6 pre-defined and modular zinc finger domains (D. J. Segal, B. Dreier, R. 
R. Beerii, C. F. Barbas. Ill, Proc. NaU. Acad. Sci, USA 96, 2758 (1999)). The repressor 
protein contains the Kox-1 KRAB domain (J. F. Margolin ef a/.. Proc. Nati. Acad. Sd. USA 
91 , 4509 (1994)), whereas the transactivator VP64 contains a tetrameric repeat of the 
minimal activation domain (K. Seipel, O. Georgiev, W. Schaffner, EMBOJ, 11, 4961 (1992)) 
derived from the herpes simplex virus protein VP16. 

A derivative of the human cervical carcinoma cell line HeLa, HeLa/tet-off, was utilized 
(M. Gossen and H. Bujard, Proc. Natl. Acad. Set. USA 89, 5547 (1992)). Since HeLa cells 
are of epithelial origin they express ErbB-2 and are well suited for studies of ertB-2 gene 
targeting. HeLa/tet-off cells produce the tetracycline-controlled transactivator, allowing 
induction of a gene of interest under the control of a tetracycline response element (TRE) 
by removal of tetracycline or Its derivative doxycycline (Dox) from the growth medium. We 
have used this system to place our transcription factors under chemical control. Thus, the 
pRevTRE/E2C-SKD and pRevTRE/E20VP64 plasmids were constructed (The E2C(Sp1)- 
KRAB and £2C(Sp1V-VP64 coding regions were PCR amplified from pcDNA3-based 
expression plasmids (R. R. Beerii, D. J. Segal. B. Dreier, C. F. Barbas, III, Proc. Natl. Acad. 
ScL USA 95, 14628 (1998)) and subcloned into pRevTRE (Clontech) using BamHI and 
Clal restriction sites, and into pMX-IRES-GFP [X. Liu etal., Proc, Natl. Acad. Sci. USA 94, 
10669 (1997)] using BamHI and NotI restriction sites. Fidelity of the PCR amplification was 
confirmed by sequencing), transfected Into HeLa/tet-off cells, and 20 stable clones each 
were isolated and analyzed for Dox-dependent target gene regulation (The pRevTRE/E2C- 
KRAB and pRevTRE/E2C-VP64 constructs were transfected into the HeLa/tet-off cell line 
(M. Gossen and H. Bujard, Proc. Natl. Acad. Sci. USA 89, 5547 (1992)) using 
Lipofectamine Plus reagent (GIbco BRL). After two weeks of selection in hygromycin- 
containing medium, in the presence of 2 |ig/ml Dox. stable clones were isolated and 
analyzed for Dox-dependent regulation of ErbB-2 expression. Western blots, 
immunoprecipitations. Northern blots, and flow cytometric analyses were earned out 
essentially as described [D. Graus-Porta, R. R. Beerii. N. E. Hynes, MoL Cell, Biol. 15. 1182 
(1995)].). As a read-out of erbB-2 promoter activity, ErbB-2 protein levels were initially 
analyzed by Western blotting. A significant fraction of these clones showed regulation of 
ErbB-2 expression upon renrK)val of Dox for 4 days, i.e. downregulation of Eri3B-2 in E2C- 
KRAB clones and upregulation in E2C-VP64 clones. ErbB-2 protein levels were correlated 
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with altered levels ot their specific mRNA, indicating that regulation of ErbB-2 expression 
was a result of repression or activation of transcription. The additional ErbB-2 protein 
expressed in E2C-VP64 clones was indistinguishable from naturally expressed protein and 
biologically active, since epidermal growth factor (EGF) readily induced its tyrosine 
phosphorylation. The ErbB-2 levels in the E2C-KRAB done #27, in the absence of Dox, 
were below the level of detection as was its EGF-induced tyrosine phosphorylation. 
Therefore, ErbB-2 expression was also analyzed by flow cytometry, revealing no detectable 
ErbB-2 expression in E2C-KRAB done #27, in sharp contrast to the dramatic upregulation 
(5.6 fold) of ErbB-2 in E2C-VP64 clone #18. Thus, the extent of erbB-2 gene regulation 
ranged from total repression (E2C-KRAB clone #27) to almost 6-fotd activation (E2C-VP64 
clone #18). No significant effect on the expression of the related ErbB-1 protein was 
obsen/ed, indicating that regulation of ErbB-2 expression was not a result of general down- 
or up-regulation of transcription. In contrast to the efficacy of these transcription factors that 
target 18 bps of DNA sequence using six zinc finger domains, transcriptional activators 
prepared with three zinc finger domains that bind either of the 9-bp hjfilf-siles of the E2C 
target sequence were unable to activate transcription of an 6ri3B-2-luciferase reporter. 
These results suggest that the increased spedflclty and affinity of six finger proteins may be 
required to provide a dominant effect on gene regulation. 

EXAMPLE 9: Introduction of the coding regions of the E2C-KRAB and E2C-VP64 
proteins Into the retroviral vector pMX-IRES-GFP. 

In order to express the E2C-KRAB and E2C-VP64 proteins in several other cell lines, 
their coding regions were introduced into the retroviral vector pMX-IRES-QFP. The 
E2C(Sp1)-KRAB and E2C{Sp1)-VP64 coding regions were PCR amplified from pcDNA3- 
based expression plasmids (R. R. Beerii, D. J. Segal. B. Dreier. C. F. Barit>as, III, Proc. Natl. 
Acad. Set. USA 95. 14628 (1998)) and subcloned Into pRevTRE (Clontech) using BamHI 
and Cla1 restriction sites, and Into pIVIX-IRES-GFP [X. Liu eta!., Proc. Natt. Acad Sci. USA 
94. 10669 (1997)] using BamHI and Not1 restriction sites. Fidelity of the PCR amplification 
was conftnned by sequendng. This vector expresses a single bidstronic message for the 
translation of the zinc finger protein and, from an internal ribosome-entry site (IRES), the 
green fluorescent protein (GFP). Since both coding regions share the same mRNA. their 
expression Is physically linked to one another and GFP expression is an indicator of zinc 
finger expression. Vims prepared from these plasmids was then used to Infect the human 
carcinoma cell line A431 (pMX-IRES-GFP/E2C-KRAB and pMX-lRES-GFP/E2C-VP64 
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Plasmlds were transiently transfected Into the amphotropic packaging cell line Phoenix 
Ampho using LIpofectamlne Plus (Gibco BRL) and, two days later, culture supematants 
were used for Infection of target cells in the presence of 8 ng/ml polybrene. Three days after 
infection, cells were harvested for analysis). Three days after infection, ErbB-2 expression 
was measured by flow cytometry. Significantly, about 59% of the E2C-KRAB virus treated 
cells were essentially ErbB-2 negative, while in about 27% of the E2C-VP64 virus treated 
cells ErbB-2 levels were increased. Plotting of GFP fluorescence vs, ErbB-2 fluorescence 
revealed that there were two cell populations, one with normal ErbB-2 levels that was GFP 
negative, and another with altered ErbB-2 levels that was GFP positive. Specificity of gene 
targeting was investigated by measuring the expression levels of the related ErbB-1 and 
ErbB-3 proteins. No significant alterations of these protein levels were detected, indicating 
that erbB-2 gene targeting Is specific and not a non-specific result of general alterations in 
gene expression or overexpression of the effector domains. The tack of any appreciable 
regulation of erbB-3 Is parb'culariy remarkable since its 5'-UTR contains the 18bp sequence 
5'-GGa GCC GGA GCG GgA GTc-3' (SEQ ID NO:122), that present^ only 3 mismatches to 
E2C's designed target sequence (15bp Identity - lowercase letters indicate differences) (M. 
H. Kraus, W. Issing. T. Mikl, N. C. Popescu, S. A. Aaronson, Proc. Natt, Acad $cf. USA 86, 
9193 (1989)). 



EXAMPLE 10: Regulation of the erbB-2 gene in non-human primate cells. 

The zinc finger target sequence within erbB-2*s 5'-UTR lays within a 28-bp sequence 
stretch that is conserved in many species. To investigate regulation of erbB-2 gene 
expression in non-human primate cells, COS-7 fibroblasts were infected with the bicislronic 
E2C-KRAB retrovirus and analyzed by flow cytometry. As in human cells, expression of the 
repressor protein as indicated by the GFP marker correlated well with a loss of ErbB-2 
protein. Sfmilariy, gene targeting in murine cells was evaluated by Infection of NIH/3T3 cells 
with E2C-KRAB and E2C-VP64 encoding retrovirus. ErbB-2 expression levels were then 
monitored by Western btotting rather than flow cytometry, due to a lack of reactivity of the 
mAb with the murine ErbB-2 extracellular domain. Here again, with E2C-KRAB a complete 
transcriptional knockout upon con-ection for infected cells was obsen^ed. However, unlike in 
human cell lines. E2C-VP64 induced EitB-2 upregulatton was rather modest in NIH/3T3 
cells, approximately 1.8 foW upon correction for infection efficiency. A likely explanation for 
this discrepancy lies In the different stmctures of the human and mouse promoters. The 
mouse erbB-2 promoter, unlike the human, does not contain a TATA box (M. R. While and 
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M. C. Hung, Oncogene 7, 677 (1992)). Transcriptional activation by VP16 is, at least in part, 
mediated by Its interaction with TFIID, a multi-protein complex also containing the TATA- 
binding protein (C. J. Ingles. M. Shales. W. D. Cress, S. J. Triezenberg, J. Greenblatt, 
Nature 351. 688 (1991 )). It is therefore plausible that the E2C-VP64 protein activates 
transcription less effectively In the absence of a TATA box. These data suggest that while a 
DNA binding site may be conserved with respect to sequence and relative position within a 
target cell, effector domains may need to be optimized for maximal efficiency due to context 
effects. Nevertheless, while their potencies may differ, the artificial transcription factors 
described here are capable of Imposing regulation of erbB-2 gene transcription In cells 
derived from different species, providing a strategy for the study of gene function in a 
variety of organisms, 

EXAMPLE 11: Specific induction of G1 accumulation of ErbB-2 overexpressing tumor 
cells. 

Overexpression of ErtB-2 leads to constitutive activation of its intrinsic tyrosine kinase 
activity (P. P. Di Fiore et aL, Science 237, 1 78 (1 987)), and It has been shown that 
downregulation of ErbB-2 in tumor cells overexpressing the receptor leads to growth 
Inhibition (R. M. Hudziak ef a/., Mol. Cell. Bioi 9, 1165 (1989); J. Deshane etai. Gene Ther, 
1, 332 (1994); J. M. Daly et a/.. Cancer Res. 57. 3804 (1997)). The mechanism of growth 
inhlbllton appears to be that progression of the cells from the G1 to the S phase of the cell 
cycle Is prevented (R. M. Neve, H. Sutterluty. N. Pullen, H. A. Lane, J. M. Daly, W. Krek, N. 
E. Hynes. Submitted for publication). Thus, we investigated if expression of our designed 
transcriptional repressor in ert5B-2 overexpressing tumor cells would lead to a G1 block. 
Therefore, SKBR3 breast cancer cells were Infected with E2C-KRAB retrovirus and cell- 
cycle distribution was analyzed in relation to ErbB-2 expression levels by flow cytometry 
(22). Two cell populations were obsen^ed: about 40% of the cells were not infected and had 
nomial ErbB-2 levels, while the infected cells, -60%. displayed approximately 7-fold 
reduced receptor levels after 3 days. Compared to cells with normal receptor levels, a 
significantly larger fraction of cells with decreased ErbB-2 expression levels was In the G1 
phase of the cell cycle. To ascertain that the G1 accumulation obsewed with SKBR3 cells 
was specific for ErbB-2 overexpressing tumor cells, a similar analysis was carried out with 
the T47D breast cancer cell line, which does not display elevated levels of ErbB-2 (Fig. 4B). 
Indeed, when T47D cells were infected with the E2C-KRAB retrovims and subjected to flow 
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cytometric analysis, cell populations with normal and reduced ErbB-2 levels were found to 
display indistinguishable DNA content. Thus, our designed repressor protein is able to 
iO specifically induce Gl accumulation of ErbB-2 overexpressing tumor cells. The ability to 

inhibit cell-cycle progression, and hence inhibit growth of ErbB-2 overexpressing tumor cells 
suggests the potential of designed transcription factors for cancer gene therapy. 
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CLAIMS: 

1 . An isolated and purified zinc finger-nucleotide binding polypeptide that contains a 
nucleotide binding region having the sequence of any of SEQ ID NO: 1-16. 

2. A composition comprising from 2 to about 12 of the polypeptide of claim 1 . 

3 . The composition of claim 2 containing from 2 to about 6 polypeptides. 

4. The composition of claim 2 or 3 wherein the polypeptides are operatively linked. 

5. The composition of any of claims 2 to 4 wherein the polypeptides are linked by a 
linker having the sequence of SEQ ID NO 1 1 1 . 

6. The composition of any of claims 2 to S wherein each of the polypeptides binds to a 
different nucleotide sequence. 

7. The composition of any of claims 2 to 6 that binds to a nucleotide that contains the 
sequence 5 •(GNN)b-3\ wherein each N is A, C, G, or T with the proviso that all 
N's cannot be C and where n is 2 to 6. 

8. The polypeptide of claim 1 further operatively linked to one or nx>fe transcription 
regulating factors. 

9. The composition of any of claims 2 to 7 further operatively linked to one or more 
transcription regulating factors. 

10. An isolated and purified polynucleotide that encodes the polypeptkle of claim 1 . 

11. An isolated and purified polynucleotide that encodes the composition of any of 
claims 2 to 7. 
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12. An expression vector containing the polynucleotide of claim 10. 

13. An expression vector containing the polynucleotide of claim 1 1 . 

14. A process of regulating a nucleotide sequence that contains the sequence 5 -(GNN)„- 
3', where n is an integer from 1 to 6, the process comprising exposing the nucleotide 
sequence to an effective amount of the composition of any of claims 2 to 7. 

15. The process of claim 14 wherein the sequence 5 -(GNN)n-3* is located in the 
transcribed region of the nucleotide sequence. 

16. The process of claim 14 wherein the sequence 5 -(GNN)„-3* is located in a promotor 
region of the nucleotide sequence. 

17. The process of claim 14 wherein the sequence 5 -(GNN)»-3' is located within an 
expressed sequence tag. 

18. The process of claim 14 wherein the composition is opcratively linked to one or 
more transcription modulating factors. 

1 9. Medicament comprising the composition of any of claims 2 to 7. 

20. Use of . the composition of any of claims 2 to 7 for the manufacture of a 
medicament for the treatment of a human being. 

2 1 . Use of . the composition of any of claims 2 to 7 for the manufacture of a 
medicament for the treatment of cancer. 
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Target Binding-helix amino 

5' 3' acids at positions 

(SEQ ID NO: ) -112 3 4 5 6 


Specificity 


GAA (1) QSSNLVR 


GAA (GAT) 


(17) Q R S N L V R 


GAA, GAT 


(18) Q S G N L V R 


GAN 


(19) Q P G N L V R 


GAN 






GAG (2) DPGNLVR 


GAG 


(20) D P G N L K R 


GAG, GAT 






GAG (2) RcjnKTTT/'p 




(21) R S D N L R R 


GAG, GGG 


(22) K S A N L V R 


GAG, (GAT) 


(23) R S D N L V K 


GAG, (GGG) 


(24) K S A Q L V R 


UNSPEC . 


! 








GAT (4) TSGNLVR 


GAT 






GCA (5) QSGDLRR 


GCA, GCT 


(25) Q S S T L V R 


GTA, GCA 


(26) Q S G T L R R 


GTA, 
GCA/T/C 


(27) Q P G D L V R 


GCT, 
GCC, 
GCA 
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(28) Q G P D L V R 


GCT , GCA 


(29) Q A G T L M R 


GTA, 
GCA 


(30) Q P G T L V R 


GTA, GCA 


(31) Q G P E L V R 


non-binder 






GCC (6) DCRDLAR 


GCC 


(32) G C R E L S R 


GCC 


(33) D P S T L K R 


GCC (GCA/T 
GTC) 


(J4) DPSDLKR 


GCC, 


(35) D S G D L V R 


GCC, 
GAC 


(36) D S G E L V R 


GCT, 

GCC 


(37) D S G E L K R 


GCT, 
GCC, 
GTC 






GCG (7) RSDDLVK 


GCG 


(38) R L D T L G R 


GNG 


(39) R P G D L V R 


GCG, 
GNG, 
GCN 


(40) R S D T L V R 


NG 


(41) K S A D L K R 


GAG, GTG, 
GCT, GCC 


(42) R S D D L V R 


GAG, (GNG, 
GCN) 


(43) R S D T L V K 


GNG 
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(44) K S A E L K R 


GCT, 

GCC, UNSPEC. 


(45) K S A E L V R 


GCT, 
GCC, 
UNSPEC . 


(46) R G P E L V R 


UNSPEC . 


(47) K P G E L V R 


NON-BINDER, 
BUT EXPR. 






GCT (8) TSGELVR 


GCT 


(48) S S Q T L R 


GCT 


(49) T P G E L V R 


GCT 


(50) TSGDIjVR 


(GCC, GCA) 


(51) S S Q T L V R 


GCT 


(52) T S Q T L T R 


GCT (GAT, 

GTC, GCC) 


(53) T S G E L K R 


GCT, GCC 


(54) OSSDLVR 


GCT 
(GCA, GCC) 


(55) S S G T L V R 


GCC , GCT 


(56) T P G T L V R 


GCT, 
GTC 


(57) T S Q D L K R 


GCC, 
GCT 


(58) T S G T L V R 


GCT, 
UNSPEC . 






GGA (9) QRAHLER 


GGA 


(59) Q S S H L V R 


GGA 


(60) Q S G H L V R 


GGA 
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(61) Q P G H L V R 


GGA, 
GCT 






GGC (10) D P G H L V R 


GGC 


(62) E R S K L A R 


GGC 


(63) D P G H L A R 


GGC 


(64) Q R A K L E R 


GGC 


(65) Q S S K L V R 


GGC 


(66) D R S K L A R 


GGC, GGN 


(67) D P G K L A R 


GGC , unspec . 






GGG (11) RSDKLVR 


GGG 


(68) R S D K L T R 


GGG 


(69) R S D H L T R 


GGG, GAG 


{ f\J I KSAKLiER 


JM UJN — n IJN Ui:#K 






GGT (12) T S G H L V R 


GGT, 
GGA 


(71) T A D H L S R 


GGT, 
GAT 


(72) T A D K L S R 


GGG, (GGT) 


(73) T P G H L V R 


GGT , unspec . 


(74) T S S H L V R 


unspec . 


(75) T S G K L V R 


unspec . 






GTA (13) QSSSLVR 




(76) Q P G E L V R 


GTA, 

(GCT) 


(77) Q S G E L V R 


GTA, 
GCA/C 
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(78) O S G E L R R 


GTA, 
GCA/T/C 






GTC (14) DPGALVR 




(79) D P G S L V R 


GTC (GCT, 
GCC) 






GTG (15) RSDELVR 


GTG, (GAG, 
GGG) 


(80) R K D S L V R 


GTG, GNG 


vol) RSDVLVR 


GTG, 
GAG, 
GGG 


VO-^/ J\ rl U O Li J_j Iv 


nTfi nap 
GNG 


(83) R S D A L V R 


GAG, GTG, 
GGG 


(84) R S S S L V R 


GTG 


(85) R S S S H V R 


GTG, GGG 


(86) R S D E L V K 


GTG 


(87) R S D A L V K 


GAG 
GTG 
GGG 


(88) R S D V L V K 


GAG 
GNG 


(89) R S S A L V R 


GNG 


(90) R K D S L V K 


GGG 
GNG 


(91) R S A S L V R 


GAG, unspec. 


(92) R S D S L V R 


GCT 
unspec . 


(93) R I H S L V R 


unspec . 
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Fig. 1 



1^4; K It o o Jj V x\ 


UNSPEC. 


(95) R G P S L V R 


UNSPEC . 


(96) R P G A L V R 


UNSPEC . 


(97) K S A S L V R 


NON- BINDER 


(98) K S A A L V R 


NON-BINDER 


(99) K S A V L V R 


NON-BINDER 














GTT (16) i T S G S L V R 


GTT, 


(100) T S G S L T R 


- GGT, GCT 


(101) T S Q S L V R 


GAT, GTA 
GCT , GCA 


(102) T S S S L V R 


GTA, 
GAT 


(103) T P G S L V R 


GTA 


(104) T S G A L V R 


GGT, 
GCT, 
GAT 


(105) T P G A L V R 


GGT, 
GAT, 
GCT 


(106) T G G S L V R 


GGT, 
GAT 


(107) T S G E L V R 


GCT 
GCG 
GTA 
GTT 


(108) T S G E L T R 


GCT 
GTA/T/C 


(109) T S S A L V K 


UNSPEC 


(110) T S S A L V R 


UNSPEC 
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1. Claims: 1-21 (partial) 

an isolated and purified zinc f inger-nucleotide binding 
polypeptide that contains a nucleotide binding region having 
the seouence of SEQ ID No.; 1; compositions comprising from 
2 to about 12 of isolated and purified zinc 
finger-nucleotide binding polypeptides containing the 
nucleotide binding regions having the sequence of any of SEQ 
ID No.: 2-16 and wherein at least one of said polypeptides 
contains the nucleotide binding region having the sequence 
of SEQ ID No.: 1; an isolated and purified polynucleotide 
encoding said polypeptide or compositions thereof; 
expression vectors; a process of regulating a nucleotide 
sequence that contains 5'-{GNN)n>3\ where n is an integer 
from I to 6. the process comprising exposing the nucleotide 
sequence to an effective amount of these compositions; 
medicament comprising these compositions and uses thereof 



2. Claims: 1-21 (partial) 

an isolated and purified zinc finger-nucleotide binding 
polypeptide that contains a nucleotide binding region having 
the sequence of SEQ ID No.: 2; compositions comprising from 
2 to about 12 of isolated and purified zinc 
finger-nucleotide binding polypeptides containing the 
nucleotide binding regions having the sequence of any of SEQ 
ID No.: 1 or 3-16 and wherein at least one of said 
polypeptides contains the nucleotide binding region having 
the sequence of SEQ ID No.: 2; an isolated and purified 
polynucleotide encoding said polypeptide or conpositions 
thereof; expression vectors; a process of regulating a 
nucleotide sequence that contains 5'-(GNN)n-3\ where n is 
an integer from 1 to 6, the process comprising exposing the 
nucleotide sequence to an effective amount of these 
compositions; medicament comprising these compositions and 
uses thereof 



3. Claims: 1-21 (partial) 

an isolated and purified zinc finger-nucleotide binding 
polypeptide that contains a nucleotide binding region having 
the sequence of SEQ ID No.: 3; conpositions comprising from 
2 to about 12 of isolated and purified zinc 
finger-nucleotide binding polypeptides containing the 
nucleotide binding regions having the sequence of any of SEQ 
ID No.: 1-2 or 4-16 and wherein at least one of said 
polypeptides contains the nucleotide binding region having 
the sequence of SEQ ID No.: 3; an isolated and purified 
polynucleotide encoding said polypeptide or compositions 
thereof; expression vectors; a process of regulating a 
nucleotide sequence that contains 5' -(GNN)n-3' , where n is 
an integer from 1 to 6. the process comprising exposing the 
nucleotide sequence to an effective amount of these 
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compositions; medicament comprising these compositions and 
uses thereof 



4. Claims: 1-21 (partial) 

an isolated and purified zinc f inger-nucleotide binding 
polypeptide that contains a nucleotide binding region having 
the sequence of SEQ ID No.: 4; compositions comprising from 
2 to about 12 of isolated and purified zinc 
finger-nucleotide binding polypeptides containing the 
nucleotide binding regions having the sequence of any of SEQ 
ID No.: 1-3 or 5-16 and wherein at least one of said 
polypeptides contains the nucleotide binding region having 
the sequence of SEQ ID No.: 4; an isolated and purified 
polynucleotide encoding said polypeptide or compositions 
thereof; expression vectors; a process of regulating a 
nucleotide sequence that contains 5 ' - (GNN)n-3 ' , where n is 
an integer from 1 to 6. the process comprising exposing the 
nucleotide sequence to an effective amount of these 
compositions; medicament conprising these compositions and 
uses thereof 



5. Claims: 1-21 (partial) 

an isolated and purified zinc finger-nucleotide binding 
polypeptide that contains a nucleotide binding region having 
the sequence of SEQ ID No.: 5; compositions comprising from 
2 to about 12 of isolated and purified zinc 
finger-nucleotide binding polypeptides containing the 
nucleotide binding regions having the sequence of any of SEQ 
10 No.: 1-4 or 6-16 and wherein at least one of said 
polypeptides contains the nucleotide binding region having 
the sequence of SEQ ID No,: 5; an isolated and purified 
polynucleotide encoding said polypeptide or compositions 
thereof; expression vectors; a process of regulating a 
nucleotide sequence that contains 5 ' - (GNN)n-3 ' » where n is 
an integer from 1 to 6, the process comprising exposing the 
nucleotide sequence to an effective amount of these 
compositions; medicament comprising these compositions and 
uses thereof 



6. Claims: 1-21 (partial) 

an isolated and purified zinc finger-nucleotide binding 
polypeptide that contains a nucleotide binding region having 
the sequence of SEQ ID No.: 6; corrposi tions comprising from 
2 to about 12 of isolated and purified zinc 
finger-nucleotide binding polypeptides containing the 
nucleotide binding regions having the sequence of any of SEQ 
ID No.: 1-5 or 7-16 and wherein at least one of said 
polypeptides contains the nucleotide binding region having 
the sequence of SEQ ID No.: 6; an isolated and purified 
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polynucleotide encoding said polypeptide or compositions 
thereof; expression vectors; a process of regulating a 
nucleotide sequence that contains 5'-(GNN)n-3\ where n is 
an integer from 1 to 6. the process comprising exposing the 
nucleotide sequence to an effective amount of these 
compositions; medicament comprising these compositions and 
uses thereof 



7. Claims: 1-21 (partial) 

an isolated and purified zinc f inger-nucleotide binding 
polypeptide that contains a nucleotide binding region having 
the sequence of SEQ ID No.: 7; compositions comprising from 
2 to about 12 of isolated and purified zinc 
finger-nucleotide binding polypeptides containing the 
nucleotide binding regions having the sequence of any of 5tQ 
ID No.: 1-6 or 8-16 and wherein at least one of said 
polypeptides contains the nucleotide binding region having 
the sequence of SEQ ID No.: 7; an isolated and purified 
polynucleotide encoding said polypeptide .or compositions 
thereof; expression vectors; a process of regulating a 
nucleotide sequence that contains 5*-(GNN)n-3' . where n is 
an integer from 1 to 6, the process comprising exposing the 
nucleotide sequence to an effective amount of these 
compositions; medicament conprising these compositions and 
uses thereof 



8. Claims: 1-21 (partial) 

an isolated and purified zinc finger-nucleotide binding 
polypeptide that contains a nucleotide binding region having 
the sequence of SEQ ID No.: 8; compositions comprising from 
2 to about 12 of isolated and purified zinc 
finger-nucleotide binding polypeptides containing the 
nucleotide binding regions having the sequence of any of SEQ 
ID No.: 1-7 or 9-16 and wherein at least one of said 
polypeptides contains the nucleotide binding region having 
the sequence of SEQ ID No.: 8; an isolated and purified 
polynucleotide encoding said polypeptide or compositions 
thereof; expression vectors; a process of regulating a 
nucleotide sequence that contains 5'-(6NN)n-3' , where n is 
an integer from 1 to 6, the process comprising exposing the 
nucleotide sequence to an effective amount of these 
conpositions; medicament comprising these compositions and 
uses thereof 



9. Claims: 1-21 (partial) 

an isolated and purified zinc finger-nucleotide binding 
polypeptide that contains a nucleotide binding region having 
the sequence of SEQ ID No.: 9; compositions comprising from 
2 to about 12 of isolated and purified zinc 
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f inger-nucleotide binding polypeptides containing the 
nucleotide binding regions having the sequence of any of SEQ 
ID No.: 1-8 or 10-16 and wherein at least one of said 
polypeptides contains the nucleotide binding region having 
the sequence of SEQ ID No.: 9; an isolated and purified 
polynucleotide encoding said polypeptide or compositions 
thereof; expression vectors; a process of regulating a 
nucleotide sequence that contains 5'-(6NN)n-3' . where n is 
an integer from 1 to 6, the process comprising exposing the 
nucleotide sequence to an effective amount of these 
compositions; medicament conprising these compositions and 
uses thereof 



IQ. Claims: 1-21 (partial) 

an isolated and purified zinc f inger-nucleotide binding 
polypeptide that contains a nucleotide binding region having 
the sequence of SEQ ID No.: 10; compositions comprising from 
2 to about 12 of isolated and purified zinc 
finger-nucleotide binding polypeptides containing the 
nucleotide binding regions having the sequence of any of SEQ 
ID No.: 1-9 or 11-16 and wherein at least one of said 
polypeptides contains the nucleotide binding region having 
the sequence of SEQ ID No.: 10; an isolated and purified 
polynucleotide encoding said polypeptide or compositions 
thereof; expression vectors; a process of regulating a 
nucleotide sequence that contains 5 ' -(GNN)n-3 ' , where n is 
an integer from 1 to 6, the process comprising exposing the 
nucleotide sequence to an effective amount of these 
conpositions; medicament comprising these compositions and 
uses thereof 



11. Claims: 1-21 (partial) 

an isolated and purified zinc finger-nucleotide binding 
polypeptide that contains a nucleotide binding region having 
the sequence of SEQ ID No.: 11; compositions comprising from 
2 to about 12 of isolated and purified zinc 
finger-nucleotide binding polypeptides containing the 
nucleotide binding regions having the sequence of any of SEQ 
ID No.: 1-10 or 12-16 and wherein at least one of said 
polypeptides contains the nucleotide binding region having 
the sequence of SEQ ID No.: 11; an isolated and purified 
polynucleotide encoding said polypeptide or compositions 
thereof; expression vectors; a process of regulating a 
nucleotide sequence that contains 5*-(6NN)n-3' , where n is 
an integer from 1 to 6, the process comprising exposing the 
nucleotide sequence to an effective amount of these 
compositions; medicament comprising these compositions and 
uses thereof 



12. Claims: 1-21 (partial) 
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an isolated and purified zinc f inger-nucleotide binding 
polypeptide that contains a nucleotide binding region having 
the sequence of SEQ ID No,: 12; composUions comprising from 
2 to about 12 of isolated and purified zinc ' 
finger-nucleotide binding polypeptides containing the 
nucleotide binding regions having the sequence of any of SEQ 
ID No • 1-11 or 13-16 and wherein at least one of said 
polypeptides contains the nucleotide binding region having 
the sequence of SEQ ID No.: 12; an isolated and purified 
polynucleotide encoding said polypeptide or compositions 
thereof; expression vectors; a process of regulating a 
nucleotide sequence that contains 5' -(GNN)n-3' , where n s 
an integer from 1 to 6, the process comprising exposing the 
nucleotide sequence to an effective amount of these 
compositions; medicament comprising these compositions ana 
uses thereof 



13. Claims: 1-21 (partial) 

an isolated and purified zinc f inger-nucleotide binding 
polypeptide that contains a nucleotide binding region having 
the sequence of SEQ ID No.: 13; compositions comprising from 
2 to about 12 of isolated and purified zinc 
finger-nucleotide binding polypeptides containing the 
nucleotide binding regions having the sequence of any of 5hQ 
ID No.; 1-12 or 14-16 and wherein at least one of saia 
polypeptides contains the nucleotide binding region having 
the sequence of SEQ ID No.: 13; an isolated and punfied 
polynucleotide encoding said polypeptide or compositions 
thereof; expression vectors; a process of regulating a 
nucleotide sequence that contains 5'-(GNN)n-3 . where n is 
an integer from 1 to 6. the process comprising exposing the 
nucleotide sequence to an effective amount of these 
compositions; medicament comprising these compositions ana 
uses thereof 



14. Claims: 1-21 (partial) 

an isolated and purified zinc finger-nucleotide binding 
Dolvpeptide that contains a nucleotide binding region having 
the sequence of SEQ ID No.: 14; compositions comprising from 
2 to about 12 of isolated and purified zinc 
finger-nucleotide binding polypeptides containing the 
nucleotide binding regions having the sequence of any of btq 
ID No ' 1-13 or 15-16 and wherein at least one of said 
polypeptides contains the nucleotide binding region having 
the sequence of SEQ ID No.: 14; an isolated and punfied 
polynucleotide encoding said polypeptide or conpositions 
thereof; expression vectors; a process of regulating a 
nucleotide sequence that contains 5'-(GNN)n-3 . where n is 
an integer from I to 6. the process comprising exposing the 
nucleotide sequence to an effective amount of these 
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compositions; medicament comprising these compositions and 
uses thereof 

15. Claims: 1-21 (partial) 

an isolated and purified zinc f inger-nucl eotide binding 
polypeptide that contains a nucleotide binding region having 
the sequence of SEQ ID Mo.: 15; conpositions comprising from 
2 to about 12 of isolated and purified zinc 
finger-nucleotide binding polypeptides containing the 
nucleotide binding regions having the sequence of any of SEQ 
ID No.: 1-14 or 16 and wherein at least one of said 
polypeptides contains the nucleotide binding region having 
the sequence of SEQ ID No.: IS; an isolated and purified 
polynucleotide encoding said polypeptide or compositions 
thereof; expression vectors; a process of regulating a 
nucleotide sequence that contains 5' -(GNN)n-3' , where n is 
an integer from 1 to 6. the process comprising exposing the 
nucleotide sequence to an effective amount of these 
compositions; medicament comprising these, compositions and 
uses thereof 



16. Claims: 1-21 (partial) 

an isolated and purified zinc finger-nucleotide binding 
polypeptide that contains a nucleotide binding region having 
the sequence of SEQ ID No.: 16; compositions comprising from 
2 to about 12 of isolated and purified zinc 
finger-nucleotide binding polypeptides containing the 
nucleotide binding regions having the sequence of any of SEQ 
10 No.: 1-15 and wherein at least one of said polypeptides 
contains the nucleotide binding region having the sequence 
of SEQ 10 No.: 16; an isolated and purified polynucleotide 
encoding said polypeptide or compositions thereof; 
expression vectors; a process of regulating a nucleotide 
sequence that contains 5 ' -(GNN)n-3 • , where n is an integer 
from 1 to 6, the process comprising exposing the nucleotide 
sequence to an effective amount of these compositions; 
medicament comprising these compositions and uses thereof 
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